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Reply to all 


Scientists discussing their work through written media, including e-mail, should be aware that 
they could at any time be asked to reveal their conversations. 


data, and/or communications.” Towards the end of last year, 

those orders appeared in a subpoena that landed at the Woods 
Hole Oceanographic Institution in Massachusetts. The energy firm BP 
demanded that Woods Hole produce e-mails and other documents 
related to its research on the 2010 Deepwater Horizon oil spill in the 
Gulf of Mexico. Woods Hole fought the sweeping request, but a US 
district court has now forced researchers at the institute to surrender 
thousands of e-mails. That decision has disturbing implications for 
science in the United States, although the situation is perhaps not as 
dire as some have warned. 

The demand for the e-mails emerged from a huge lawsuit, in which 
BP is being sued by the US government and others affected by the oil 
spill. As part of that suit, the company faces fines of up to US$4,300 
per barrel of oil spilled, which could amount to more than $17 billion 
if the court sides with government estimates for the size of the spill. BP 
argues that these estimates — much higher than the company’s own 
— rely heavily on research conducted by Woods Hole. And it claims 
that aspects of the work involve “puzzling, apparently arbitrary, sus- 
piciously offsetting, and highly significant decisions by Woods Hole 
researchers”. 

Woods Hole is not a party to the lawsuit, but it did have a contract 
with the government during the first month of the spill to measure 
how much oil and gas was spewing from the broken well. After that, 
researchers from the institute worked to refine their analyses and 
publish papers on the spill in academic journals. One Woods Hole 
scientist also participated in a governmental panel asked to estimate 
the size of the spill. 

To defend itself in court, BP contends that it needs more than just 
the raw data collected by Woods Hole anda description of the methods 
used by its researchers. Woods Hole turned over much of the data and 
analysis tools to BP on its original request, but it fought against sur- 
rendering confidential academic communications, arguing that those 
have been protected in the past by courts, which have recognized the 
importance of ‘scholastic privilege’ Indeed, the judge who ruled over 
Woods Hole’s arguments recognized the principle of scholastic privi- 
lege regarding confidentiality, but only up to a point. She found that 
BP hada compelling need for the Woods Hole e-mails and other com- 
munications for the period before the government-led group issued 
its report in March last year. The e-mails handed to BP could be made 
public if the case goes to trial. 

But the judge did not allow BP access to communications after that 
time, when the Woods Hole researchers were preparing academic pub- 
lications. “If BP gains access to the analysis documents for these article 
[sic], that could hamper future research efforts,” she said. 

The decision pleased neither BP nor Woods Hole. BP appealed, 
seeking all the documents it requested, but was unsuccessful. Woods 
Hole contends that surrendering the e-mails will have a chilling effect 


¢ C Ye are commanded to produce... any and all documents, 


on science, by forcing researchers to avoid topics that could become 
subjects of litigation. Is the institute right? It is hard to judge the scale 
of the potential damage the decision could cause. Even well before the 
subpoena, some researchers declined to study the spill, in part because 

of the legal risk, so the decision may not make 


“Itis hard to much difference on that front. Researchers 
judge the scale who work in other contentious fields, such 
of the potential as animal experimentation, are not gener- 
damage the ally scared off their work, even by threats of 
decision could physical harm. 

cause.” The decision on the Woods Hole e-mails 


should nonetheless serve as a warning. 
Researchers who choose to study topics that have multiple and con- 
flicting implications for powerful special interests must go in with 
their eyes open and be aware of the risks. Not only have Woods Hole 
researchers had to surrender their e-mails, they have also spent hun- 
dreds of hours complying with the subpoena. And the case has drained 
considerable funds from the non-profit institution through lawyers’ 
fees and lost staff time. 

The decision provides yet another reminder that scientists should 
not regard e-mails as being private. Whether by theft or by court 
order, such communications can be taken away. Think before you 
press Send. = 


Calling Nordic 
mentors 


Be year, Nature awards prizes to senior researchers in the 
natural sciences for sustained and outstanding scientific 
mentoring of young researchers. There are two prizes, each 
worth €10,000 (US$12,500): one for lifetime achievement, one 
for mid-career achievement. 

The competitions are organized on a country or regional 
basis. We shall give this year’s accolades to researchers who live 
in the Nordic countries: Sweden, Norway, Denmark, Finland, 
Iceland and Greenland. 

Nominations are now open, with a deadline of Monday 
13 August. Each applicant should be nominated by five people 
who have been mentored by her or him over different periods. 
The forms for nominations, and other essential details about 
the competition, including the list of judges, can be found at 
go.nature.com/bxzc9y. 
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WORLD VIEW jensen 


whether these methods eventually succeed in countering climate 

change, and whether they will be embraced by the wider popula- 
tion, concerns more than scientists alone. That is why, in the wake of 
the cancellation last month of the Stratospheric Particle Injection for 
Climate Engineering (SPICE) field trial and the attendant publicity, it 
is important that the correct lessons are drawn by scientists, funders, 
regulators and politicians alike. 

In an Editorial about SPICE, this publication pointed out that geo- 
engineers must “grasp the nettle of regulation and oversight” (see 
Nature 485, 415; 2012). Itadded that community-proposed guidelines 
such as the Oxford Principles need practical implementation frame- 
works to make an impact. Both issues do demand 
urgent attention, but there is a more important 
consideration: something that geoengineers com- 
ing together to “draft detailed, practical actions’, 
as the Editorial recommended, simply cannot 
address. Geoengineering will alter the geopolitics 
of climate change and this cannot be ignored by 
climate policy-makers. 

SPICE turned the focus of the geoengineering 
debate onto patenting and concerns that personal 
or corporate ownership could take precedence 
over global public interest. But patents are far 
from the biggest issue for techniques such as solar 
geoengineering (blocking the Sun’s radiation, as 
investigated by the SPICE project). 

The cost of injecting enough aerosols into the 
stratosphere to counter the warming projected for 
even high-emission scenarios is estimated to be 
only about US$1 billion per year. That amounts 
to less than $0.01 per year to compensate for each 
tonne of carbon dioxide emitted. And most of the necessary technology 
exists already, such as high-altitude aircraft (of which only a couple of 
dozen would be needed). Inventors of solar-geoengineering methods 
might try to charge large sums to license patents on their ideas, but this 
will not create a lucrative new industry for large corporations. 

That said, existing industries, especially agriculture and energy, will 
have a strong interest in whether and when solar geoengineering is 
used, given that these methods could have sweeping effects on climate, 
both desirable and undesirable. But industry influence will occur 
much more through political lobbying than through patent ownership. 

At present, the power in climate negotiations is squarely in the hands 
of the major carbon-emitting nations. Technically, a coalition of vulner- 
able nations — say, low-lying island states threat- 


cientists are developing geoengineering technologies. But 


ened by rising seas — might be able to muster OD NATURE.COM 
$1 billion per year to reverse global warming, _ Discuss this article 
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Researchers can’t regulate 
climate engineering alone 


Political interests, not scientists or inventors, will be the biggest influence on 
technologies to counter climate change, says Jason Blackstock. 


geopolitical calculations. We scientists know this. We have consulted 
with civil society, the private sector and government officials through 
the 2010 Asilomar Conference in California — which worked on pol- 
icy recommendations for geoengineering — and the ongoing Solar 
Radiation Management Governance Initiative launched by the Royal 
Society in London. Both have indicated the need for action beyond 
geoengineering researchers. But if assembling these groups has not yet 
generated the necessary ‘practical actions, then what will? 

Geoengineering researchers can experiment with ways to translate 
guidelines into a practical regulatory framework, beginning with a vol- 
untary registry of ongoing and planned research. But responsibility also 
rests with funders. More research is needed to ensure that we understand 
the abilities and limits of solar-geoengineering 
technology before it is deployed in political des- 
peration. It would bea tragedy if the negative pub- 
licity around SPICE scared funds away from this 
work. Funders can also help to establish norms 
of international cooperation. First steps would 
include creating incentives for collaboration on 
research, and sharing lessons on how to respon- 
sibly oversee and monitor controversial projects. 

Ultimately, climate policy-makers need to 
get involved. Current international institutions 
are simply not prepared for geoengineering. 
Discussing solar geoengineering at the United 
Nations climate talks now would only disrupt 
progress towards the already ambitious goal of 
agreeing a global mitigation and adaptation strat- 
egy by 2015. And the Convention on Biological 
Diversity, the only international body yet to 
tackle the issue of solar geoengineering directly, 
has neither the mandate nor sufficient political 
clout to broker a geopolitical agreement. 

Climate negotiators and political leaders need to develop strate- 
gies to fill the governance gap. They also need to consider the signals 
that domestic funding of geoengineering research sends about future 
climate-policy intentions. Failure to come to grips with these issues 
could lead to problems if events such as geoengineering field experi- 
ments outpace political preparations. 

Greater political engagement may be uncomfortable for climate 
scientists. Climate science has fought hard to resist agendas that force 
particular research or conclusions. But such agendas — both corpo- 
rate and national — do influence climate policy, and geoengineering 
technologies could magnify that influence. Politics can’t be avoided, 
and ignoring it is dangerous for all of us. m 


Jason Blackstock is a visiting fellow with the Institute for Science, 
Innovation and Society at the University of Oxford, UK. 
e-mail: jason. blackstock@insis.ox.ac.uk 
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RESEARCH HIGHLIGHTS sic 


Religion as 
cuckoldry defence 


How can patriarchal societies 
be sure that the men who 
inherit their fathers’ goods are 
actually their fathers’ sons? 
Religions help men to be more 
confident in their paternity by 
setting standards for female 
sexual behaviour. 

Beverly Strassmann at the 
University of Michigan in Ann 
Arbor and her team studied 
genetic data from 1,706 father- 
son pairs among the Dogon of 
Mali. Although many Dogon 
are Christian or Muslim, the 
adherents of their indigenous 
religion had the lowest rate of 
male births due to cuckoldry: 
1.3% versus 2.9% for the other 
faiths. The researchers chalk 
this up to the traditional huts 
where menstruating women 
must spend their nights. In a 
society in which women are 
often pregnant or unable to 
conceive owing to intensive 
breast-feeding, a sojourn in the 
hut informs a woman's husband 
and his kin that she has become 
fertile again. The family can 
then increase their vigilance 
towards her. 

Proc. Natl Acad. Sci. USA 
http://dx.doi.org/10.1073/ 
pnas.1110442109 (2012) 


Synthetic silk 
inspired by insect 


The green lacewing (Mallada 
signata), an insect used in 
biological pest control, has 
inspired the creation of an 
artificial silk. 

To protect its progeny from 
predators, the female lacewing 
suspends its eggs from the 
underside ofa leaf using a thin 
silk stalk of remarkable tensile 
strength. To forge a similar 
stalk, Thomas Scheibel and 
Felix Bauer at the University 


Weighing extinct animals 


Knowing an extinct animal's mass is crucial for estimating its 
physiological traits but the standard technique — in which 

a model of the animal is made and its mass then calculated 
from its density — has been criticized for being too subjective. 

Bill Sellers at the University of Manchester, UK, and his 
team laser-scanned the skeletons of 14 large mammals, 
including the bison and the elephant. The researchers used 
the scans to mathematically derive ‘convex hulls’ of the bones 
— the minimum volume that encloses a set of points, akin to 
gift-wrapping a teapot — and converted these volumes into 
estimates of mass. These were then compared to the known 
values. 

The method consistently underestimated true body mass 
by 21%. Thus, using this method and then adding 21% should 
provide more accurate predictions. When the team used 
the corrected technique on Giraffatitan brancai, one of the 
largest dinosaur skeletons in the world, the beast clocked in at 
23,200 kilograms, similar to recent volumetric estimates. 

Biol. Lett. http://dx.doi.org/10.1098/rsbl.2012.0263 (2012) 
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of Bayreuth in Germany 
developed a synthetic protein 
based on a repeating amino- 
acid pattern found in one of 
the proteins in lacewing silk. 
The researchers introduced a 
synthetic gene that codes for 
the protein into Escherichia coli 
bacteria. 

Filaments of the bacteria- 
produced silk proved to be 
almost as strong as natural 
lacewing silk, except at high 
humidity, and more elastic. 
Angew. Chem. Int. Edn. 
http://dx.doi.org/10.1002/ 
anie.201200591 (2012) 


Live birth without 
a placenta 


Some sharks and rays give 
birth to live young but lack 
placentas or umbilical cords. 
So how do their embryos 
obtain oxygen? 

Taketeru Tomita of the 
Hokkaido University Museum 
in Hakodate, Japan, and his 
team used ultrasound to study 
a pregnant manta ray (Manta 
alfredi) to find out. They found 
that the embryo rhythmically 
raises and lowers its jaw, 
pumping uterine fluid into 
its mouth and through holes 
behind its eyes called spiracles. 
After birth, the spiracles shrink 
as the animal becomes able 
to force water over its gills by 
swimming. 

Biol. Lett. http://dx.doi. 
org/10.1098/rsbl.2012.0288 
(2012) 


DRUG DISCOVERY 


Melanoma 
pathway targeted 


A drug that targets the RAS 
signalling cascade — which 
is thought to drive tumour 
growth in 20% of cancers — 
improves survival in some 
patients with advanced 
melanoma. 


W. SELLERS 


UNIVERSITY OF TUBINGEN 


The drug, called trametinib, 
inhibits MEK, a protein that 
amplifies cancer-promoting 
signals. Keith Flaherty at 
the Massachusetts General 
Hospital in Boston and his 
team tested trametinib in 
melanoma patients with a 
mutated version of a RAS- 
pathway protein BRAF, 
which is mutated in 50% of 
advanced melanoma patients. 
A total of 322 patients were 
randomly assigned to receive 
either trametinib or standard 
chemotherapy. 

Twenty-two percent 
of patients responded to 
trametinib, which delayed 
disease progression by 
3 months longer than 
chemotherapy alone. 

N. Engl. J. Med. http://dx.doi. 
org/10.1056/nejmoa1203421 
(2012) 


Cultural 
wellspring 


Certain forms of art and music 
might have emerged among 
ancient humans living in 
southern Germany and then 
spread through Europe along 
the Danube River. 

The Aurignacian culture, 
which is characterized by 
innovations such as figurative 
art and musical instruments 
(mammoth-ivory flute 
pictured), originated in 
Europe more than 40,000 years 
ago. However, imprecision 
in carbon dating has made 
it difficult to determine 
precisely where the culture 
arose. Thomas Higham at 
the University of Oxford, 

UK, Nicholas Conard at 

the University of Tubingen, 
Germany, and their team used 
improved sample preparation 
techniques for carbon dating 
to study Aurignacian remains 
from Geifenklésterle cave in 
southern Germany’s Swabian 
Jura region. 

The analysis revealed that 
the artefacts at Geifenklésterle 
are about 42,500 years old. 
This pre-dates other recently 


dated Aurignacian sites in 
Europe. 

J. Hum. Evol. http://dx.doi. 
org/10.1016/j.jhevol.2012.03.003 
(2012) 


EXTRASOLAR PLANETS 


A new world 
is born 


Astronomers have identified a 
planet in the making. 

The variable brightness of 
GM Cephei — a 4-million- 
year-old star 870 parsecs away 
— has been noted for decades. 
But whether that variability is 
caused by bursts of light from 
the star or by an object in the 
disk of dust that surrounds it 
has been a matter of debate. 

Wen-Ping Chen at the 
National Central University 
in Jhongli, Taiwan, and his 
team argue for the latter. Using 
telescopes positioned around 
the world, the researchers 
monitored GM Cephei from 
2009 to 2011 and found that the 
brightness of the star dipped 
each year for about a month. 
The researchers interpret this 
variability as evidence ofa 
young planet: a clump of dust, 
roughly the mass of an asteroid, 
which is orbiting GM Cephei. 
Astrophys. J. 751: 118 (2012) 


METABOLISM 


A pathway for 
feeding control 


The activation of a group of 
neurons in a region of the 
brain called the hypothalamus 
boosts food intake, but what 
controls this activation has 
been unclear. Researchers have 
now identified a receptor in 
these neurons that is targeted 
by a protein called FoxO1 to 
promote feeding. 

The FoxO1 protein is 
involved in regulating the 
signalling of key hormones 
that suppress appetite. 
Domenico Accili at Columbia 
University in New York and 
his team deleted FoxO1 from 
mouse hypothalamic neurons 
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COMMUNITY 


CHOICE 


EVOLUTION 


Domesticated apes 


> HIGHLY READ 

on journals.elsevier 
com/animal-benaviour 
March-May 


Selection against aggression seems 

to have occurred naturally in some 
animals and to have led to traits similar 
to those seen in domesticated animals. 


The bonobo (Pan paniscus) is 

less aggressive and more sociable than its sister species, the 
chimpanzee (Pan troglodytes). Brian Hare at Duke University 
in Durham, North Carolina, and his team propose that 
these differences, along with a suite of physical variations — 
bonobos have smaller childlike heads, plus paler lips and tails, 
than chimps — are a linked set of traits that parallel those 
seen in domesticated animals such as dogs and guineapigs. 
Bonobos are, the authors argue, a self-domesticated ape. 

Selection pressures that might have favoured bonobo self- 
domestication include the acquisition of bigger territories, 
reducing competition for food and the emergence of 
coalitions of females that enforced the peace. 

Domestication, far from being a human invention, may 
also occur spontaneously in nature, they say. 


Anim. Behav. 83, 573-585 (2012) 


and found that the animals 
were leaner, ate less and had 
improved glucose and fat 
metabolism compared with 
normal mice. The researchers 
pinpointed the receptor Gpr17 
as mediating the effects of 
FoxOl. 

Targeting this pathway could 
bea strategy for anti-obesity 
drugs, the authors say. 

Cell 149, 1314-1326 (2012) 


Ship GPS could 
flag tsunamis 


Commercial ships on Earth’s 
oceans could provide a 
cheap and easy way to track 
propagating tsunamis. 
Current warning systems 
rely on sparse, expensive 
buoys and underwater 
sensors that track a wave once 
it has been triggered by an 
earthquake. James Foster at the 
University of Hawaii at Manoa 
in Honolulu and his team 
examined Global Positioning 
System (GPS) data froma 
research vessel (pictured) that 
was heading from Hawaii to 
Guam in February 2010 when 


an underwater earthquake 
occurred off the coast of 
Chile. Filtering the data on 
the basis of tsunami models 
allowed the researchers to 
differentiate between choppy 
seas and changes in ocean 
surface height due to a tsunami. 
The team detected a roughly 
10-centimetre rise as the 
tsunami passed, and estimated 
wave speed and arrival time. 
Although data from a single 
ship could be prone to false 
positives, recruiting thousands 
of ships could overcome this 
problem, the authors suggest. 
Geophys. Res. Lett. http://dx.doi. 
org/10.1029/2012GL051367 
(2012) 
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Nuclear restarts 


Two of Japan’s shuttered 
nuclear reactors may soon 
reopen, having received a 
safety thumbs-up froma 
scientific panel appointed by 
regional government. The 
reactors, at the Ohi plant in 
Fukui prefecture, had already 
passed safety tests in January. 
But public protests (and some 
nuclear analysts) opposed 
their restart. On 8 June, Prime 
Minister Yoshihiko Noda 
appeared on public television 
asking for his nation’s support 
to reopen the nuclear reactors, 
and on 10 June, a 12-member 
scientific panel appointed 

by the Fukui prefecture’s 
governor reported that the Ohi 
plant could be operated safely. 


X-ray mission axed 
NASA has cancelled a small 
astrophysics mission that 
would have studied the 
polarized X-rays streaming 
from black holes and neutron 
stars. The Gravity and Extreme 
Magnetism Small Explorer 
(GEMS) had been selected 

for funding in 2009 and 

was scheduled to launch in 
2014, but was estimated to 

be running 20-30% over its 
US$119-million budget by the 
time it was axed on 7 June. See 
go.nature.com/bit1rn for more. 


Altering embryos 
Reproductive procedures that 
would spare children from 
inheriting mitochondrial 
diseases received approval from 
an influential UK bioethics 
body on 12 June. The technique 
involves transferring genetic 
material from an egg cell 
containing faulty mitochondria 
to an egg from another woman 
that has been stripped of its 
nucleus (see Nature 481, 419; 
2012). The London-based 
Nuffield Council on Bioethics 
found that these procedures are 


Great Barrier Reef protection row 


State and federal governments in Australia 

are clashing over the approval of industrial 
developments that could damage the Great 
Barrier Reef. On 5 June, environment minister 
Tony Burke delayed the approvals process 

for a multibillion-dollar coalmining project 

in Queensland, calling the state’s operation 
“shambolic’. Three days earlier, a report from 


ethically acceptable, if proven 
safe and effective by additional 
research currently under way, 
and if conducted as clinical 
trials. The UK government 
begins a public consultation 

in September, a first step to 
legalizing the process. See 
go.nature.com/fpn5bo 

for more. 


Environment gloom 
Only four of the 90 most 
important internationally 
agreed environmental 

goals have seen “significant 
progress’, the United Nations 
Environment Programme 
warned on 6 June. Its fifth 
Global Environment Outlook 
is the UN’s main status report 
on the health of the world’s 
ecosystems, and was released 
two weeks ahead of the Rio+20 
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summit in Brazil. Only in the 
areas of stopping ozone-layer 
depletion, removing lead in 
fuel, improving access to water 
and marine pollution research 
has there been any major 
advance in line with promised 
objectives, the report says. 


Cancer fund 

An attempt in California to 
create a cancer-research fund 
from the proceeds of tobacco 
taxes looked to have been 
narrowly defeated, but was 
still too close to call, after a 
state vote on the measure on 
5 June. Proposition 29 would 
add a US$1 tax to every pack 
of cigarettes and generate 
around $735 million by 
2013-14 for cancer research, 
smoking prevention and 
smoking cessation. The ‘yes’ 
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the United Nations Educational, Scientific and 
Cultural Organization (UNESCO) had said that 
the reef was threatened by the rapid increase 

in coastal development. UNESCO's Heritage 
Committee is meeting from 24 June to consider 
the report's findings — which stopped short of 
recommending a potentially embarrassing ‘in 
danger designation. 


side had garnered 49.2% of 
the vote, just 63,000 behind 
those rejecting the idea — 
but hundreds of thousands 
of postal votes are yet to be 
counted. See go.nature.com/ 
yck8eu for more. 


Piezonuclear row 
More than 1,000 scientists 
have signed an Internet appeal 
to Italy’s research ministry 
requesting that the National 
Institute for Meteorological 
Research (INRiM) in Turin 
withdraw its intention to study 
‘piezonuclear reactions, nuclear 
fission that can supposedly 

be induced by mechanical 
stress such as compression of 
crystals. Alberto Carpinteri, 

a prominent proponent of 
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SOURCE: PLOS ONE 


the controversial idea, was 
appointed president of INRiM 
last August, and introduced 

the theme into the institute’s 
2012-14 research plan, which 
was published on 29 February. 
On 6 June, parliamentarians 
also asked the research minister 
about INRiM’s use of public 
funds. 


Thawed brains 

About 150 brains donated for 
research — including more 
than 50 from people with 
autism — may no longer be 

fit for study after a freezer 
malfunction at the Harvard 
Brain Tissue Resource 

Center at McLean Hospital 

in Belmont, Massachusetts. 
After the refrigeration system 
and two independent alarms 
all failed, an investigator 
discovered the calamity on 

31 May. On 11 June, staff 

at Autism Speaks, a non- 
profit organization based in 
New York City that leads a 
programme to distribute brain 
samples to autism researchers, 
alerted the families of donors to 
the situation. An investigation 
into the freezer breakdown is 
to be completed by the end of 
June. See go.nature.com/ful4vi 
for more. 


Sci-fi writer dies 
Ray Bradbury (pictured), 
the acclaimed author of 

the science-fiction classics 


The open-access journal PLoS 
ONE has swiftly expanded since 
its launch in late 2006 to become 


the world’s largest journal; it 
has already published more 


than 9,000 articles this year (see 
chart). Its growth has encouraged 


the launch of an innovative 
open-access journal, Peer], 


which announced its business 
model on 12 June. But instead 
of charging authors per article, 
a one-off membership fee will 
give them the rights to publish 


peer-reviewed articles in Peer] for 


life. See page 166 for more. 


The Martian Chronicles and 


Fahrenheit 451, among other 
works, died on 5 June aged 
91. Bradbury’s works tapped 
both scientific wonder and the 
unease about technological 
change, exploring how 
science and technology shape 
human identity. His stories 
anticipated flat-screen TVs, 
personal stereos and the 
‘butterfly effect’ of chaos 
theory — but, in an oft- 
repeated remark, he said that 
he tried not to predict the 
future, but to prevent it. 


Stealing secrets 


A nanotechnology researcher 
who until recently worked at 
Sandia National Laboratories 
in Albuquerque, New 
Mexico, has pleaded not 
guilty to embezzling and 
passing research information 
to institutions in China. 
Jianyu Huang was charged 
on 5 June with five counts of 
federal-programme fraud 
from 2009, as well as falsely 
telling a counterintelligence 
officer that he would not take 


RISE OF OPEN ACCESS 


a Sandia-owned laptop on a 
trip to China. Sandia said that 
Huang had been fired in April 
for violating procedures, and 
his facility did not have access 
to classified national-security 
information. 


Heartland fallout 
The Pacific Institute in 
Oakland, California, has 
reinstated Peter Gleick as its 
president. Gleick had taken 

a leave of absence after his 
embarrassing admission 

that he had lied to obtain 
internal budget documents 
from the Heartland Institute, 
a libertarian think tank in 
Chicago, Illinois, that aims to 
combat climate science. Ina 
6 June statement, the Pacific 
Institute accepted Gleick’s 
apology, and said that an 
independent investigation 
had confirmed his account 
of events. See go.nature.com/ 
cjnqqa for more. 


} FUNDING 
Nobel thrift 


This year’s Nobel prize 
recipients will split a pot of 
money 20% smaller than that 
shared by last year’s laureates. 
In response to sluggish 
financial markets, which 
have eaten into the capital 
that supports the prize, the 
Nobel Committee said on 

11 June that it would divide 
up 8 million Swedish kroner 
(US$1.1 million) between 


The journal PLoS ONE expects to publish more than 2,000 


research articles this month. 


Research papers published per month 


2007 2008 


1,000 sederaasuonesssnasicaasssassacatentsvonsavss anes uataresenysse 


500 smsedbaenenastenasentasaetarsnsarsneatsrtararsagaianserenentte 


2009 2010 2011 


SEVEN DAYS | THIS WEEK 


20-22 JUNE 

Rio de Janeiro, Brazil, 
hosts this year’s 
biggest environmental 
meeting, the United 
Nations Conference 
on Sustainable 
Development — 

20 years after the same 
city hosted the original 
UN Earth Summit. 
www.earthsummit2012.org 


21 JUNE 

The Royal Society in 
London launches a 
report on how best 

to open up access to 
modern research’s vast 
volumes of data. 
go.nature.com/lyiefm 


the winners of each prize to 
be announced in October; 
last year’s prizes amounted to 
10 million kroner apiece. It 

is the first time the prize has 
been cut in nominal value 
since 1949. See go.nature. 
com/1sml4p for more. 


Telescope funding 
The European Southern 
Observatory (ESO) 
announced on 11 June that 
its member states have given 
approval to start building the 
€1.1-billion (US$1.4-billion) 
European Extremely Large 
Telescope, although not 
everyone has committed their 
budget contributions. At a 
council meeting in Garching, 
Germany, six member states 
voted for the programme 
and four backed it pending 
confirmation of support by 
national ministries; four 
others are yet to agree to 
support the project, which is 
planned for Cerro Armazones 
in Chile’s Atacama Desert. 
However, almost one-third 
of the telescope’s funding 

is scheduled to come from 
Brazil, which has not yet 
ratified its ESO membership. 


> NATURE.COM 
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Health interventions may have reduced child death in African villages, but it isn’t clear by how much. 


GLOBAL HEALTH 


Poverty project 
opens to scrutiny 


Panel set to reshape evaluation of Millennium Villages 
research after partial retraction of health claims. 


BY DECLAN BUTLER 


r Vhe Millennium Villages Project (MVP) 
aims to chart a course out of poverty for 
the most deprived people in Africa. But 

after being forced to partially retract the find- 

ings of a paper that included the claim that its 
interventions had slashed child mortality, the 
project is revamping its oversight to make its 
science more robust and determine whether 
its approach works. 

The MVP, a joint venture between Colum- 
bia University and the non-profit organi- 
zation Millennium Promise, both in New 


York, is simultaneously applying a raft of aid 
interventions — from bed nets for malaria 
control to fertilizers — to help impoverished 
villages to meet the United Nations’ eight Mil- 
lennium Development Goals (MDGs), includ- 
ing reducing hunger, extreme poverty and the 
burden of major diseases, and increasing levels 
of primary education. The first Millennium 
Villages were launched 
in Ethiopia and Kenya 
in 2004 and 2005, and 
the project now oper- 
ates in 10 African coun- 
tries, reaching about 


© NATURE.COM 
To read about how 
Africa is building 
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500,000 people in 80 villages over 14 sites. 

Ina paper published online in The Lancet 
last month’, the project claimed a significant 
milestone. It reported that after three years of 
interventions, child mortality was decreas- 
ing three times faster in the project’s villages 
than in the host nations in general. But the 
analysis was criticized for underestimating 
nationwide improvements in child mortality, 
and overestimating those in the Millennium 
Villages’. The paper's lead author, Paul Pro- 
nyk, later accepted that this claim had put the 
Millennium Village mortality reduction “in 
an excessively positive light”, and retracted 
it. Pronyk then resigned as director of 
Monitoring and Evaluation at Columbia 
University’s Center for Global Health and 
Economic Development. 

The MVP’s founder, Jeffrey Sachs, head of 
the Earth Institute at Columbia and a co-author 
of the partially retracted paper, says that the 
MVP research teams were too autonomous, 
and he regrets not having brought in external 
advisers earlier. “I don’t want such mistakes to 
occur again,” he says. Sachs has now created a 
faculty committee to oversee MVP research 
and increase interactions with outside research- 
ers; he will co-chair the committee, along with 
Cheryl Palm, a scientist at the Earth Institute's 
tropical agriculture programme. 

An external advisory board called the Inter- 
national Scientific Expert Advisory Group will 
oversee how the MVP assesses the impact of 
its work. The panel is still being assembled, 
but will be chaired by Robert Black, head of 
the department of international health at 
Johns Hopkins Bloomberg School of Public 
Health in Baltimore, Maryland. Black says that 
Sachs has given him a remit “to open the study 
to more scrutiny, to be more transparent, to 
be very explicit about what are the expected 
outcomes of the study, and to be sure that eval- 
uation is as well done, as rigorously, as possi- 
ble”. Black also wants to respond to criticisms 
by giving external researchers access to as 
much of the MVP data as he can, as well as to 
detailed information on the project's costs, so 
that outsiders can have greater confidence in 
the project’s results and cost-benefit analyses. 

It is too soon to say what sort of recommen- 
dations the external panel will make on the 
MVP's future research in the study villages. But 
given that the project is already almost seven 
years into its planned ten-year pilot phase, the 
scope for change is limited, says Black. “Itis > 
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> impossible to completely redesign the study, 
though many of us would have designed it dif- 
ferently,” he says. 

Michael Clemens, a migration and develop- 
ment researcher at the Center for Global 
Development, an independent research insti- 
tution in Washington DC, is unimpressed with 
the creation of the advisory panel, and argues 
that the MVP needs rigorous and transpar- 
ent evaluation from completely independent 
groups. Clemens, who co-authored a letter’ to 
The Lancet that led to the partial retraction of 
the MVP’s findings, believes that the paper still 
has problems — in particular, a claim that the 
child mortality rate in Millennium Villages fell 


by one-third more than that in matched com- 
parison villages over the course of the study. 
“The whole study should have been retracted,” 
he says, adding that he has submitted a new 
letter to The Lancet to this effect. 

Black says that whether the MVP ultimately 
provesa success or failure, it will still provide use- 
ful information, so it is important to improve its 
evaluation mechanisms. “I don’t think it should 
be dismissed because it has flaws,” he says. 

The UK government’s Department for Inter- 
national Development last year launched its 
own £3.8-million (US$5.9-million) independ- 
ent evaluation to accompany an £11.5-million 
ten-year grant to create Millennium Villages 


covering 30,000 people in Ghana. “A massive 
effort is needed to raise millions of people in 
Africa out of poverty. Millennium Villages 
represent one approach, and they could make 
a major contribution,’ the department notes. 
“Such an opportunity should not be missed 
merely for want of evidence. But nor should 
scaled-up resources be committed to a model 
without assessing whether it is indeed a cost- 
effective approach.” = 
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Journal offers flat fee for 
‘all you can publish’ 


Latest venture is part of an explosion of ideas for open-access publishing. 


BY RICHARD VAN NOORDEN 


cience-publishing ventures continually 
battle for market space, yet most operate 


on one of only two basic business models. 
Either subscribers pay for access, or authors 
pay for each publication — often thousands of 
dollars — with access being free. But in what 
publishing experts say is a radical experiment, 
an open-access venture called PeerJ, which 
formally announced its launch on 12 June, is 
carving outa fresh niche. It is asking its authors 
for only a one-off fee to secure a lifetime mem- 
bership that will allow them to publish free, 
peer-reviewed research papers. 

Relying on a custom-built, open-source 
platform to streamline its publication process, 
Peer] aims to drive down the costs of research 
publishing, say its founders: Peter Binfield, who 
until recently was publisher of the world’s larg- 
est journal, PLoS ONE, and Jason Hoyt, who 
previously worked at the research-paper-shar- 
ing site Mendeley. Their involvement is a major 
reason for the buzz around Peer]. “I thought 
— wow — if the people I’m hearing about 
are working there — that’s the sign of some- 
thing happening. It makes it less crazy,’ says 
John Wilbanks, an advocate of open access and 
a senior fellow at the Ewing Marion Kauffman 
Foundation in Kansas City, Missouri. 

Peer] is just one of a flurry of experiments, 
encouraged in part by the gathering momen- 
tum of open access, that might shape the 
future of research publishing. “We are seeing 
a Cambrian explosion of experiments with 
new publishing models. It’s going to be an 
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interesting period for the next few years,” says 
Binfield. 

Binfield hopes PeerJ’s growth will resemble 
that of PLoS ONE, which went from publishing 
some 1,000 articles in its first full year (2007) 
to its current 2,000 articles a month. “PLoS 
ONE is publishing so many articles that it is 
stretching the boundaries of what is a journal 
— instead, it’s becoming a large, peer-reviewed 
repository of research articles. We're setting 
ourselves up for exploring that future,” says 
Binfield. But he adds that Peer] will not need 
PLoS ONE’s volume of papers to be viable. 

Whereas PLoS 


ONE charges $1,350 “thought _ 

per paper, Peer] wow—ifthe 
users pay $299 for peopleI’m 
unlimited open- hearing about 
access publications areworking 

and submissions, or there — that’s the 
a smaller fee ($199 signof something 


or $99) for a limited 
number per year. (All 
authors on multi-author papers must be mem- 
bers, although papers with 13 or more authors 
need only 12 paying members.) The journal, 
which received undisclosed start-up support 
from the venture-capital fund O'Reilly Alpha- 
TechVentures in San Francisco, California, will 
be accepting articles from August. 

Despite the low publication cost, Peer]’s 
founders promise that, as with PLoS ONE, 
articles will be peer reviewed for scientific 
validity — but not for importance or impact. 
Other open-access journals have also adopted 
this policy, including Nature Publishing 


happening.” 
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Group’ Scientific Reports. It marks a distinc- 
tion from selective open-access journals such 
as the forthcoming eLife, which plans to pub- 
lish only high-impact work. To avoid running 
out of peer reviewers, every Peer] member is 
required each year to review at least one paper 
or participate in post-publication peer review. 

Untangling user fees from the publica- 
tion of individual articles is a significant 
innovation — but other radical ideas are 
in the pipeline. In high-energy physics, for 
example, a consortium called SCOAP?, 
which includes funding agencies and librar- 
ies, is planning to pay publishers for all the 
costs of publication, so that articles can 
be free to access and authors will not be 
charged directly. On 1 June, the SCOAP* 
initiative said that it had sent out tenders to 
publishers to bid for these contracts, with 
services expected to start in January 2014. 

Other ideas under discussion include jour- 
nals that charge for submissions rather than for 
publications; direct government funding for all 
publications; and research funders setting up 
their own publication infrastructure (much as 
some do with biology databases), says Cam- 
eron Neylon, recently appointed director of 
advocacy at the Public Library of Science in 
San Francisco, which publishes PLoS ONE. 

No one knows what will work. But many say 
that the experiments now under way will help 
to reveal the true costs of sustainably publish- 
ing articles and research data. “Peer] is part 
of the assertion that this can be done cheaper 
— and for that alone it will be interesting to 
watch,” says Neylon. = 


The rise of the right: Hungary’s Jobbik party rallies supporters in Budapest. 


Genome test slammed for 
assessing ‘racial purity’ 


Hungarian far-right politician certified as ‘free of Jewish and Roma’ genes. 


BY ALISON ABBOTT 


fficials in Hungary united this week 
() to condemn ongoing ethnic violence 

and anti-Semitic attacks, including an 
assault on the former Chief Rabbi on 5 June. 
But a cause for further soul-searching has 
emerged: a scientific scandal recalling discred- 
ited notions of racial purity. 

Hungary’s Medical Research Council (ETT), 
which advises the government on health pol- 
icy, has asked public prosecutors to investigate 
a genetic-diagnostic company that certified 
that a member of parliament did not have 
Roma or Jewish heritage. 

The MP in question is a member of the far- 
right Jobbik party, which won 17% of the votes 
in the general election of April 2010. He appar- 
ently requested the certificate from the firm 
Nagy Gén Diagnostic and Research, which 
rents office space at the prestigious Eétvés 
Lorand University in Budapest. The company 
produced the document in September 2010, a 
few weeks before local elections. 

The certificate — with the MP’s name blacked 
out — emerged on the web last month and was 
seized on by the Hungarian media. One of Nagy 
Gén’s financial partners, Tibor Benedek — a 
three-time Olympic water-polo gold medal- 
list and a member of a prominent Jewish fam- 
ily — immediately pulled out of the company. 


The ETT’s secretary, Jozsef Mandl, chair of 
medical chemistry at the Semmelweis Uni- 
versity in Budapest, says that the certificate 
is “professionally wrong, ethically unaccepta- 
ble — and illegal”. The council discussed the 
issue on 7 June and concluded that the genetic 
test violates the 2008 Law on Genetics, which 
allows such testing only for health purposes. 

“The council’s stand is important,” says 
Lydia Gall, an Eastern Europe and Balkans 
researcher at civil-rights group Human Rights 
Watch, who is based in Amsterdam. In Hun- 
gary, “there have been many violent crimes 
against Roma and acts of anti-Semitism in the 
past few years’, she says. Politicians who try to 
use genetic tests to prove they are ‘pure’ Hun- 
garian fan the flames of racial hatred, she adds. 

Nagy Gén scanned 18 positions in the MP’s 
genome for variants that it says are charac- 
teristic of Roma and Jewish ethnic groups; 
its report concludes that Roma and Jewish 
ancestry can be ruled out. The certificate adds: 
“For an interpretation of the test result and for 
genetic consultation relating to the family-tree 
research, please contact us as soon as conveni- 
ent.” 

Nagy Gén did not respond to e-mail and 
telephone requests from Nature for comment. 
Buta statement on its website claims that news- 
papers had reported the story “incompletely” 
and points to the certificate’s recommendation 
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for further consultation. It argues that the com- 
pany “rejects all forms of discrimination, so it 
has no right to judge the purpose for which an 
individual will use his or her test result, and so 
for ethical reasons it could not have refused to 
carry out the test”. 

The certificate first appeared on a right- 
wing website, which described the intention 
behind the gene test as “noble”, although it 
questioned the science. After the news blog 
Petéfi utca republished the certificate on 
14 May, the Hungarian Society of Human 
Genetics issued a statement condemning 
the test. Istvan Rask6, director of the Insti- 
tute of Genetics of the Hungarian Academy 
of Sciences in Szeged, and the society’s vice- 
president, says that it is impossible to deduce 
origins from genetic variations at a few places 
in the genome. “This test is complete nonsense 
and the affair is very harmful to the profession 
of clinical genetics,” he says. 

Nagy Gén’s rental contract with Edtvés 
Lorand University ended this month, says 
Gyorgy Fabri, a university spokesman. “The 
university is not commenting publicly on the 
affair because it is not our business — our 
researchers had no contact with the com- 
pany.’ In a written statement he added that 
the university “fully rejects” the abuse of 
scientific results to promote discrimination 
or hatred. m 
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The United Kingdom Infra-Red Telescope is setting records for scientific publications. 


ASTRONOMY 


Stellar UK scope 
faces closure 


Specialize or die is the mantra for medium-sized instruments. 


BY ERIC HAND 


erched near the 4,200-metre summit 
Pp: Mauna Kea in Hawaii, the United 

Kingdom Infra-Red Telescope (UKIRT) 
enjoys one of the finest sites in all of astronomy. 
It was among the world’s largest telescopes 
when it opened more than 30 years ago, and 
its enviable record of publications shows no 
signs of slowing. Yet on 30 May, the UK Sci- 
ence and Technology Facilities Council (STFC) 
announced that it will close the 3.8-metre 
telescope in 2013, unless a buyer can be found. 

UKIRT’s closure could be the first of many, 
say concerned astronomers. Just as 1- and 
2-metre telescopes began to be mothballed in 
the 1990s to free up funding for a new class of 
8-10-metre instruments, more than a dozen 
4-metre telescopes around the world could 
face a similar fate as astronomers prepare for 
the arrival of 30-metre behemoths (see Nature 
479, 18-19; 2011). 

“Ts this a sign of things to come? It may be,” 
says Gary Davis, director of the Joint Astronomy 
Centre in Hawaii, which operates UKIRT. The 
cash-strapped STFC decided to sacrifice UKIRT 
to preserve its membership commitment to the 
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European Southern Observatory (ESO), which 
is planning a 40-metre giant. “Once again, the 
UK is leading the way,’ Davis adds archly. 

But survival strategies are coming to light. 
Chief among them is that, rather than using 
4-metre telescopes as general-purpose obser- 
vatories, with time being divided among hun- 
dreds of astronomers, they should be dedicated 
to specific problems requiring large collabora- 
tions, long campaigns and custom-built instru- 
ments. “We've become much more like particle 
physics, oriented towards doing specific things 
in big teams,” says Janet Drew, an astronomer at 
the University of Hertfordshire in Hatfield, UK, 
who chaired a committee planning the future 
of Europe's 2-4-metre telescopes. 

In the United States, for example, the Depart- 
ment of Energy is hoping to fund the devel- 
opment of instruments that will allow two 
ageing, 4-metre-class telescopes belonging 
to the National Optical Astronomy Observa- 
tory (NOAO) to engage 


in a quest to under- NATURE.COM 
stand dark energy, the Read more about 
mysterious force that is  30-metre 
speeding up the expan- _telescopesat: 
sion of the Universe. — go.nature.com/tlvath 
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One collaboration, the Dark Energy Survey, 
will install a 570-megapixel camera on the 
4-metre Blanco telescope in Chile. By taking 
pictures of hundreds of millions of galaxies, 
it could reveal tiny distortions in their shape 
that would betray the effect of dark energy. 
Another group, called BigBOSS, plans to 
investigate dark energy by looking for patterns 
in the distribution of 20 million galaxies using 
an instrument installed at the 4-metre Mayall 
telescope in Arizona (see Nature 481, 10-11; 
2012). “The highest-impact science right now 
are these things where you want to put in more 
than 100 nights a year,’ says NOAO director 
David Silva. 

In Europe, one productive niche has been the 
search for planets outside the Solar System. An 
instrument at the ESO’s 3.6-metre telescope at 
La Silla in Chile has already discovered more 
exoplanets than any other ground-based 
search. And in April, the University of Geneva 
in Switzerland installed a near-clone of that 
instrument in the Northern Hemisphere, at 
Italy’s 3.6-metre National Galileo Telescope in 
the Canary Islands. 

A third emerging use for these telescopes 
is to obtain spectra of millions of stars in the 
Milky Way, to try to understand their chem- 
istry and their wanderings through space. 
This would allow astronomers to reconstruct 
a detailed history of the Galaxy’s formation. 
A spectrograph specialized for this purpose 
is already being built for the 3.9-metre Anglo- 
Australian Telescope in Australia, and the ESO 
is considering one for its 3.6-metre New Tech- 
nology Telescope in Chile. 

But Drew says that it is sometimes difficult 
to convince disparate user groups to coalesce 
around a coordinated scientific campaign. And, 
especially in Europe, multinational ownership 
makes it difficult for telescope operators to 
change course. “Structures in Europe are oddly 
complex,’ she says. “They get in the way a bit” 

Moreover, UKIRT itself is proof that a dedi- 
cated speciality campaign does not guarantee 
survival. In 2005, for example, its operators 
invested in a wide-field camera that allowed it 
to survey galaxies across the sky. Patrick Roche, 
an astronomer at the University of Oxford, UK, 
and chair of UKIRT’s oversight board, says 
that in the first four months of this year, more 
than 70 papers have been published contain- 
ing UKIRT data. That puts UKIRT on course 
to being the most scientifically productive 
telescope in the world — outstripping any of 
the 8-metre-class instruments. 

The telescope’s board, which has protested 
against the closure decision, claimed that the 
facility could be kept open a year longer for 
a mere £100,000 (US$155,000). And Davis 
thinks that he might be able to find an institu- 
tion willing to take on the costs, although he 
hasn't quite settled on the best way to advertise 
a 34-year-old telescope with a fine 3.8-metre 
mirror. 

“eBay?” he wonders. “I don’t know.’ = 
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Grant review opens 
up Texas-sized rift 


Big money and big ambitions roil state’s research efforts. 


BY MEREDITH WADMAN 


hen Ronald DePinho took the 
helm at the MD Anderson Can- 
cer Center in Houston, Texas, last 


September, he wasn’t shy about his ambitions. 
He is on a “Moon shot” quest, he said, with 
the aim of making huge gains against five can- 
cers in the next decade, partly by attacking a 
void in cancer drug development. Wasting 
no time, DePinho launched a drug-discovery 
centre, the Institute for Applied Cancer Sci- 
ence (IACS), where his wife, Lynda Chin, was 
appointed chief scientist. The effort gained 
momentum in March, when an IACS research 
team led by Chin won most of a US$20-million 
commercialization grant from the taxpayer- 
funded Cancer Prevention and Research Insti- 
tute of Texas (CPRIT), based in Austin. 

But within weeks the funding coup had 
turned into a media firestorm — one that has 
led to harsh criticism of the CPRIT’s decision- 
making process and exposed tensions within 
the community the institute serves. The affair 
led the CPRIT to announce on 6 June that it 
plans to re-review the award it had made to 
Chin, this time on its scientific merits. The 
original review had considered only its com- 
merical potential. And this week the embattled 
institute has advertised for its first compliance 
officer — whose job it will be to ensure that the 
institute's internal policies and procedures for 
grant-making are followed. 

“Hindsight is twenty-twenty,’ says the 
CPRIT’s executive director William Gimson. 
“We want to hold ourselves to a higher stand- 
ard. So we are willing to come back and say: ‘If 
these questions have been raised, let’s have a 
scientific and a commercial review.” 

The controversy became public with the rev- 
elation on 8 May that the CPRIT’s chief scien- 
tific officer, Nobel laureate Alfred Gilman, was 
resigning. Gilman, it emerged, was incensed by 
the speedy awarding of the IACS grant with- 
out a scientific review and by the simultane- 
ous shelving of seven research awards worth 
$39 million that the institute's scientific review- 
ers had recommended for funding. If the situa- 
tion was not fixed, and quickly, Gilman warned 
in an e-mail to top managers obtained under 
Texas public-records law, “cancer patients will 
lose; the citizens of Texas will be deceived; the 
integrity of science in Texas will be soiled”. 

The state has been the envy of cancer 


scientists since 2007, when its citizens voted 
overwhelmingly to establish the CPRIT and 
authorized $3 billion in bond issues to support 
its efforts. The money — $300 million a year 
for ten years — began flowing in 2010. Since 
then, CPRIT has funded 387 grants worth a 
total of $671 million — most of them to aca- 
demic research institutions (see ‘Mapping the 
millions’). Roughly 17% of the CPRIT’s funds 
have gone to commercialization, helping Texas 
companies to develop cancer diagnostics and 
treatments. Before the IACS award, all grant 
applications were subject to both commercial 
and scientific review. 

Then, in Sep- 
tember, as DePinho 
and Chin arrived in 
Houston from the 
Dana-Farber Cancer 
Institute in Boston, 
Massachusetts, the 
CPRIT unveiled a dif- 
ferent species of com- 
mercial grant. Called 


“The reality is: an ‘incubator’ grant, 
we applied, our its aim was to provide 
proposal was expert technical and 
reviewedandit _ business assistance to 


got funded.” 
Ronald DePinho 


Texas-based transla- 
tional research pro- 
jects and start-ups. 
The IACS team seized on the opportunity with 
a six-and-a-half-page proposal and submitted 
its application on 11 March. By 29 March, an 
annually renewable grant worth $18 million 
per year to the IACS and $2 million to Rice 
University in Houston had been approved by 
the CPRIT’s Commercialization Review Coun- 
cil and signed off by the institute’s governing 
Oversight Committee. The grant named Chin 
as principal investigator. 

On 24 April, after seeing a copy of the grant 
proposal, Gilman wrote to the CPRIT’s Sci- 
entific Review Council, which is chaired by 
Nobel laureate Phillip Sharp at the Massachu- 
setts Institute of Technology in Cambridge, 
complaining about the award. In Gilman’s 
view, the proposal was a “back door” submis- 
sion, amounting to no more than “a vague 
organizational plan” “You would be shocked 
to see it? he wrote. 

After Gilman resigned, Gimson received a 
blistering missive from the eight members of 
the scientific council. The institute's failure > 
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> to include them in reviewing the IACS 
award “calls into question our roles and the 
integrity of the review program in general’, 
they wrote. “This by-pass is inherently unfair 
to every scientist in Texas who participates in 
the CPRIT program... These scientists have 
played by the rules.” 

Both Chin and DePinho have repeatedly 
insisted that they, too, played by the rules. 
DePinho, who asked the CPRIT if it wanted 
to re-review the grant after the controversy 
erupted, says that he is certain that the award 
will stand on its own merits and notes that 
CPRIT officials encouraged the MD Ander- 
son team to try for the award. “The reality 
is: we applied, our proposal was reviewed 
and it got funded,” he says, adding that the 
team’s submission “was done in a way that was 
totally consistent with the CPRIT’s guidelines”. 
MD Anderson provost Raymond DuBois and 
Kenneth Shine, the University of Texas System 
vice-chancellor for health affairs, to whom 
DePinho reports, say that they agree. 

Yet CPRIT correspondence suggests that 
some in authority at the institute worked behind 
the scenes to make sure that the MD Ander- 
son team qualified for the grant. For instance, 
the minutes of an 
internal telephone 
conference sent on 
4 January to Robert 
Ulrich, chairman of 
the commercializa- 
tion council, and Jerry 
Cobbs, CPRIT’s chief 


commercialization 

officer, includes an 

action item for Cobbs ss . . 

to “change some The integrity of 
wording” totheincu- Sclencein Texas 
bator grant request will be soiled.” 
for applications, “to Alfred Gilman 


broaden the scope to 
include incubators that would serve earlier- 
stage programs, such as Lynda Chin's endeavor” 
The compliance office of the University of 
Texas System, of which MD Anderson is a 
part, is investigating why the incubator grant 
application was submitted directly from Chin’s 
team without first being reviewed by the MD 
Anderson provost. The provost normally 
reviews all grant applications going out of the 
institution, looking, in part, for financial con- 
flicts, or appearances of them. “I do have some 
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A handful of institutions have received more than 
US$10 million in research funds from the Cancer 
Prevention and Research Institute of Texas. 
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concern about the precise pathway by which 
the grant was submitted,” says Shine. DePinho 
acknowledges that the application should not 
have been submitted directly to the CPRIT. 

The Houston Chronicle has also raised 
questions about potential conflicts of interest 
among some of the application’s commercial 
reviewers. The CPRIT repudiates this, noting 
that those with connections to DePinho or Rice 
University recused themselves from voting on 
the relevant parts of the award. 

Nonetheless, appearances can be problem- 
atic, says Paul Root Wolpe, director of the 
Center for Ethics at Emory University in 
Atlanta, Georgia. “An absolutely unassailable 
process is desirable,” he opines. “Greater pains 
should have been taken to avoid any appear- 
ance of favouritism? 

DePinho did not improve perceptions when, 
on 18 May, he was a guest on a national cable 
television show that offers investment tips. 
When asked which cancer drug companies 
would make good buys, he touted AVEO 
Oncology of Cambridge, mentioning that he 
had helped to found the company but not that 
he owns 542,000 shares in it. DePinho later 
apologized for his comments in an article in 
the weekly newsletter The Cancer Letter. Shine 
says that a conflict-monitoring committee for 
the University of Texas System is examining 
the incident. “We will be taking a look at that 
episode and what it has to teach us.” 

The controversy has also exposed some of 
the competing agendas within CPRIT. E-mails 
show that DePinho’s ambitious style has irked 
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some scientists who are connected with the 
CPRIT award process and that some of the 
Oversight Committee members were unhappy 
that so many grants were flowing to the Uni- 
versity of Texas Southwestern Medical Center 
at Dallas, where Gilman is based. Gilman 
wrote to CPRIT executives that the most meri- 
torious proposals were being funded, irrespec- 
tive of geography, adding: “We have assembled 
the best from outside the state to evaluate our 
programs. If we do not accept their advice we 
are arrogant and foolish.” 

Gilman's e-mails suggest that he was increas- 
ingly at odds with the Oversight Committee, 
which consists mainly of political appoin- 
tees. In his letter of resignation, he wrote that 
he wanted stay until October, which would, 
among other things, allow him to monitor a 
crucial 26 July meeting of the committee, when 
the seven grants sidelined in March, including 
five to the University of Texas Southwestern 
Medical Center worth a total of nearly $26 mil- 
lion, will again come up for approval. Gimson 
will also seek the committee’s approval to 
review the MD Anderson grant and oth- 
ers like it using a merged committee — half 
commercialization experts and half scientific 
reviewers — and to require from MD Ander- 
son a level of scientific detail not present in its 
original proposal. 

Noting that this is the first of hundreds of 
CPRIT grants to have raised questions, Gim- 
son says: “From my perspective we have had 
a stellar record. My insistence will be that we 
continue to have that stellar record? m= 
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Tabletop X-rays light up 


Compact device promises to open window on chemical reactions in the lab. 


BY KATHERINE BOURZAC 


he pressurized, cylindrical chamber 

| fits in the palm of Margaret Murnane’s 

hand. Yet out of one end of the device 

comes an X-ray beam that packs almost as 

much punch as the light generated by massive 
particle accelerators. 

Murnane and Henry Kapteyn, both physicists 
at JILA in Boulder, Colorado, a joint institute of 
the University of Colorado and the 
US National Institute of Standards 
and Technology, have reported the 
first tabletop source of ultra-short, 
laser-like pulses of low energy, or 
‘soft, X-rays. The light, capable of 
probing the structure and dynamics 
of molecules, was previously avail- 
able only at large, billion-dollar 
national facilities such as syn- 
chrotrons or free-electron lasers, 
where competition for use of the 
equipment is fierce. But the report 
by Murnane, Kapteyn and their 
colleagues, published in the 8 June 
issue of Science (T. Popmintchev et 
al. Science 336, 1287-1291; 2012), 
suggests that the devices might 
soon lie within the grasp ofa uni- 
versity laboratory budget. “For us, 
it’s incredible that we can do this 
at all in a tabletop system,” says 
Murnane. “Three years ago, people 
would have said ‘only large facilities 
can do that.” 

Murnane and Kapteyn, a hus- 
band-and-wife team who also 
head the Boulder-based company 
KMLabs, already sell a similar tab- 
letop source of extreme-ultraviolet 
light. Murnane thinks that a future soft X-ray 
source should cost about US$1 million, and 
hopes that its relatively low cost and small 
size will open up X-ray studies for materials 
scientists, biologists and others. The beams 
generated by the device could, for instance, 
help materials scientists to make better solar 
materials by tracking the paths of electrons 
through solar cells, and might allow chemists 
to trace the ultrafast dynamics of photo- 
synthesis and catalysis. “This is something 
people have been waiting for for a very long 
time,” says Oleg Shpyrko, a physicist at the Uni- 
versity of California, San Diego. Shpyrko often 
waits months to get his experiments accepted 
at the Advanced Photon Source, a synchrotron 
at Argonne National Laboratory in Illinois, 
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and must then fly his students halfway across 
the country to do the work. 

The tabletop sources rely on a technique 
called high-harmonic generation, in which 
laser light is passed through a medium that 
converts it to light of shorter wavelengths and 
higher frequencies. Shine a ruby laser into 
a quartz crystal, for example, and a beam of 
ultraviolet light comes out — albeit dimmer, 
but still focused like a laser beam. 


Chemical bonds can be probed with fast, laser-like X-ray pulses (illustrated). 


Murnane and Kapteyn have pushed high- 
harmonic generation to its limits, with a sys- 
tem that uses an infrared laser as the source 
and pressurized helium gas as the medium. 
The laser creates a strong electric field, which 
draws electrons away from the helium atoms, 
allowing the electrons to absorb energy from 
the electric field. When they slam back in to 
the helium atoms, they release that absorbed 
energy as shorter-wavelength photons — but 
only about one photon comes out for every 
5,000 infrared photons that are put in. 

The result is light at wavelengths almost as 
short as those delivered by synchrotrons. By 
increasing the pressure in the gas — something 
theorists thought might defocus the light beam 
— Murnane and Kapteyn can produce light 
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with a wavelength of 0.8 nanometres. Around 
these wavelengths, many of the chemical ele- 
ments used in magnets and superconductors 
absorb a characteristic band of light. Kapteyn 
says that this could be used by chemists to dis- 
cern the spin states of nickel atoms that make 
up the bits of information in magnetic com- 
puter hard drives, for example. 

The tabletop systems have already surpassed 
the larger light-source facilities in terms of 
laser pulse speed. Murnane and 
Kapteyn’s device produces very 
fast pulses, as short as 2.5 atto- 
seconds (10°'* seconds) — faster 
than the picosecond (10"”’) pulses 
of synchrotrons and the femto- 
second (107!) pulses of free- 
electron lasers. That timescale is 
even quicker than the making and 
breaking of chemical bonds. “At 
this timescale, we can start using 
these light sources to address ques- 
tions where we have no idea what 
the answers will be,” says Ferenc 
Krausz, a physicist at the Max 
Planck Institute for Quantum 
Optics in Garching, Germany, 
and co-founder of Femtolasers, a 
company that sells ultrafast light 
sources. 

Light produced by these 
sources is much less intense than 
that at the big national facilities, 
however. That’s one reason why 
physicist Emma Springate, who 
runs Artemis, part of the Central 
Laser Facility at the Science and 
Technology Facilities Council's 
Rutherford Appleton Laboratory 
near Didcot, UK, wants access to 
both technologies. Artemis already has one of 
KMLabs ultrafast extreme-ultraviolet sources, 
which it combines with access to a synchro- 
tron source. “The synchrotron gives you a 
static, really clear high-resolution picture, and 
the ultrafast source gives you a slightly fuzzy 
movie,’ Springate says. 

Tabletop X-ray sources could still be 
several years away, Murnane says, but she 
hopes that they will one day be as common in 
labs as electron microscopes are. Shpyrko, for 
one, is looking forward to that day. Waiting to 
get an experiment accepted at a national facil- 
ity, he explains, can feel like waiting for a space 
shuttle to launch. “If you have one of these in 
your own lab, you can dream up an experi- 
ment and try it tomorrow,’ he says. m 
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TWO PHYSICISTS 
SAY THEY HAVE 
FORCED HYDROGEN 
TO BECOME AN 
EXOTIC METAL 
THOUGHT TO 
EXIST ONLY IN THE 
HEARTS OF GIANT 
PLANETS. NOW 
THEY MUST FACE 
THEIR CRITICS. 


BY IVAN AMATO 


ikhail Eremets and Ivan Troyan 

work only with the very best, gem- 

quality diamonds. Nothing else can 

handle the stresses involved — not 
when the two physicists make a habit of forcing 
the diamonds tips together until the pressure 
between them reaches levels normally found 
at the centre of the Earth. 

Eremets and Troyan, both at the Max Planck 
Institute for Chemistry in Mainz, Germany, are 
not alone in this strange pursuit. Their appara- 
tus, known as a diamond anvil cell, is a stand- 
ard fixture in high-pressure research labs. But 
they have provoked widespread agitation in 
their research community by claiming to have 
crushed hydrogen in their cells until it gave up 
being a diatomic gas, and instead became a 
shiny, presumably monatomic, solid that con- 
ducted electricity like a metal. 

The arguments over this alleged discov- 
ery — one of the most sought-after results in 
high-pressure research — have been raging 
ever since Eremets and Troyan published their 
results last November’. “We could just read 
their paper and say it was wrong,” declares 
condensed-matter physicist William Nellis, 
an associate of Harvard University’s physics 
department in Cambridge, Massachusetts, 
and one of the team’s most vocal challengers”. 

But others, such as Raymond Jeanloz, a 
planetary scientist and high-pressure-materi- 
als researcher at the University of California, 
Berkeley, are keeping an open mind. Whether 
Eremets and Troyan are eventually proved right 
or wrong about metallic hydrogen, Jeanloz 
says, “what I feel is beautiful about their work is 
that they did a bazillion different experiments 
at these extreme conditions. They lit a fire 


under a couple of research groups.” 

On 26 June, the main figures in this 
controvery will gather in Biddeford, Maine, 
during a Gordon Research Conference on 
high-pressure science. The result could be a 
meeting of minds — or a display of fireworks. 
“This is a very intense field,” says Jeanloz. 
High-pressure scientists have been trying — 
and failing — to make metallic hydrogen ever 
since theorists first predicted its existence in 
1935 (ref. 3). Anyone claiming success can 
expect an all-out critique from rival groups — 
especially considering what is at stake. 

Not only would making metallic hydrogen in 
the laboratory allow researchers to do planetary 
science at the bench — gas-giant planets such as 
Jupiter, or the even larger ones being discovered 
around distant stars, are thought to have huge 
amounts of the stuff in their interiors — but it 
could point the way towards an entirely new 
world of high-pressure phenomena. 

“Hydrogen is the simplest atom, the simplest 
molecule and perhaps the most complicated 
elemental solid? says Arthur Ruoff, a high-pres- 
sure physicist at Cornell University in Ithaca, 
New York. In 1968, Cornell physicist Neil Ash- 
croft predicted that solid metallic hydrogen 
might be a superconductor’. In 2004, calcula- 
tions’ by Ashcroft and others suggested that, 
under certain combinations of pressure and 
temperature, hydrogen atoms would rearrange 
themselves into a new kind of quantum liquid 
with attributes of both superconductors, which 
conduct electricity without resistance, and 
superfluids, which flow without resistance. 


BRIGHT FUTURE 

Such exotic behaviours become particularly 
interesting if some of these phases of hydrogen 
turn out to be metastable. This would mean 
that the phases could retain their high-pressure 
forms for an indefinite period once external 
forces are removed, much as diamonds formed 
by high temperatures and pressures deep inside 
Earth remain diamonds even after they reach 
the surface, instead of immediately reverting to 
carbon’s more stable form, graphite. Nellis and 
others have imagined a host of applications for 
metastable metallic hydrogen, ranging from 
super-lightweight structural materials that 
would allow entire cities floating on the sea to 
be built, to rocket fuel that packs nearly four 
times as much propellant power per kilogram 
as the liquid hydrogen used in the most power- 
ful rockets today’. 

First, however, comes the reality check. 
Diamond anvil cells can use only vanishingly 
small sample sizes. (The volume of Eremets and 
Troyan’s hydrogen samples was about 160 cubic 
micrometres — somewhat smaller than an 
average human cell.) High-pressure experi- 
ments are fraught with the potential for error. 
And even the most experienced researchers run 
the risk of fooling themselves. The question is 
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whether Eremets and Troyan are among them. 

In their experiments’, the Mainz physicists 
started with pairs of brilliant-cut diamonds. 
They trimmed each of the diamonds’ points 
into a flat surface, or culet, 20-30 micrometres 
across, then aligned the diamonds with those 
truncated tips almost touching, on either side 
of a piece of metal foil pierced with a culet- 
sized hole that would enclose a minuscule 
experimental chamber. 


THROUGH A GEM DARKLY 

For each experiment, Eremets and Troyan 
loaded hydrogen gas into the hole, and started 
tightening a set of screws that forced the dia- 
monds closer together. As the culets bit into the 
foil at the rim of the hole, the metal deformed 
around them to form a seal trapping the hydro- 
gen. Andas the force exerted by the screws was 
focused down onto the culets, the pressure in 
the chamber began to skyrocket. 

This is where high-pressure experiments 
often go awry. Although the anvil’s diamond 
jaws are made of one of the hardest materials 
known, superpressured hydrogen routinely 
infiltrates them, making them brittle and 
causing the gems to crack like cheap plastic 
cups. This also spoils the gemstones’ trans- 
parency, stopping the physicists from seeing 
what is happening in the chamber. Eremets 
and Troyan addressed that problem by apply- 
ing semi-transparent coatings to protect their 
diamonds from the infiltration of hydrogen. 
But even so, they went through about 100 pairs 
of diamonds as they worked out the bugs of 
their apparatus and ran trials and controls. 

When the pressure inside their anvils 
reached 200 billion pascals (200 GPa), or about 
2 million times Earth's atmospheric pressure at 
sea level, the two Mainz physicists saw spec- 
troscopic signs that the diatomic hydrogen 
molecules had begun to interact in a way that 
signalled that they were becoming a solid. 

At 220 GPa, the sample chamber became 
dark, another sign that hydrogen had appar- 
ently assumed a condensed phase. Eremets and 
Troyan found that laser pulses fired through 
the transparent diamond triggered small flows 
of electrons that they could detect by means 
of copper and gold electrodes deposited on the 
culet surfaces — behaviour characteristic of 
semiconducting materials, in which electrons 
need a small energy kick to flick them out of 
orbit around atoms and into a conductive flow. 

At 240 GPa, Eremets and Troyan recorded 
small currents even without the laser shots, 
an indication that room-temperature thermal 
vibrations alone could nudge the electrons into 
a conductive mood. 

And at about 270 GPa, the researchers saw 
their samples’ electrical resistance suddenly 
drop by several orders of magnitude, just 
as their spectroscopy was showing that the 
hydrogen’s molecular vibrations were slowing 


14 JUNE 2012 | VOL 486 | NATURE | 175 


Mikhail Eremets (left) and Ivan Troyan stand by the apparatus they used to create what they believe to be metallic hydrogen. 


down and morphing in ways suggestive of 
a phase transition — and the samples were 
becoming shiny like a metal. 

“It is hard to sit on data like these,” says 
Eremets. He and Troyan are convinced that 
their findings are at least consistent with hav- 
ing made metallic hydrogen. 

But others are not so sure. Sceptics note that 
pitfalls are everywhere in experiments such as 
these. If an experimenter peers through the 


diamonds and sees the sample begin to darken, 
for example, that may mean that it is becom- 
ing a solid — or that the hydrogen is reacting 
with the foil or other impurities to form metal 
hydrides. If odd electrical signals start coming 
in from the culet electrodes, they may indicate 
the formation of a new phase of hydrogen — or 
that the diamond jaws are deforming, causing 
the electrodes to short out or emit spurious sig- 
nals. Eremets says that he and Troyan are well 
aware of such pitfalls, and did their best to avoid 
them. Nonetheless, says Ruoff, “I don’t know 
anybody who thinks their claim is valid”. 

One fact that troubles Ruoff and other crit- 
ics is that the resistance rises as the sample's 
temperature drops. This is contrary to nor- 
mal metallic behaviour, in which the resist- 
ance goes down as the temperature does. But 
when reviewers of the paper raised that issue, 
says Eremets, he successfully argued that the 
upward trend is consistent with that seen in a 
disordered metal, in which the atomic struc- 
ture is more like that ofa glass than a crystal. 

Russell Hemley, a high-pressure-materi- 
als researcher at the Carnegie Institution of 
Washington in Washington DC, is sceptical of 
Eremets and Troyanss claims for another reason: 
if the hydrogen really becomes metallic, then 
the shininess that the Mainz team reports at 
optical wavelengths ought to be accompanied 
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by equally strong reflectivity at infrared wave- 
lengths. Eremets and Troyan were not able to 
get good infrared data on their samples. But 
Hemley says that in his own experiments with 
hydrogen at similar pressures, “we see it trans- 
mitting” in the infrared. Moreover, in a paper 
published in April’, Hemley and his colleagues 
report signs that hydrogen squeezed up to 
360 GPa holds on to its diatomic character and 
fails to morph into a monatomic metal. Adding 


to the complexity of the situation, some theo- 
rists suggested’ that, if squeezed enough, even 
intact hydrogen molecules might develop a new 
and crowded bonding pattern that could have 
metallic properties. 

Eremets and Troyan are standing their 
ground. “Our measurements are not perfect,” 
Eremets concedes, but he insists that “metal- 
lic hydrogen remains a viable interpretation”. 
He points to a paper published in March’ by 
Eugene Gregoryanz, a physicist at the Uni- 
versity of Edinburgh, UK, and his colleagues. 
“They observe a new phase, the same Raman 
spectra and a darkening of the sample” at 
around 220 GPa, says Eremets. Clearly, he 
argues, something is happening at that point 
— although Gregoryanz and his colleagues 
attributed the changes to formation, not of 
metallic hydrogen, but ofa previously unseen, 
graphene-like phase of molecular hydrogen. 

In an effort to clarify the situation, Eremets 
and Troyan travelled to Villigen in Switzerland 
in late April, to gather infrared spectra using the 
synchrotron light source at the Paul Scherrer 
Institute there. The data could reveal vibrations 
in the lattice structures ofhigh-pressure hydro- 
gen samples that would help theoreticians to 
identify specific structures of any solid phases 
that might be present, Eremets says. Such struc- 
tural information would not, by itself, prove 
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that the hydrogen had become metallic. But 
Eremets and Troyan also plan to attempt more 
thorough and extensive electrical measure- 
ments, which could prove conclusive. 

Looking back, Eremets admits that he and 
Troyan probably should have been more cir- 
cumspect in their paper — conveying the 
message that they might have made metallic 
hydrogen, rather than claiming more strongly 
that they had done so, as they did in their 
abstract and their concluding paragraphs. 

Meanwhile, the race for metallic hydrogen 
continues, as other high-pressure research 
groups pursue their own experiments. Jeanloz 
points out that many members of this com- 
munity, himself included, have mingled over 
the years as collaborators, postdocs, supervi- 
sors and mentors, only to go on to become each 
other’s staunchest critics and rivals, unlikely 
to let the slightest chink in an argument 
go unchallenged. What emerges from this 
dynamic, say Jeanloz and others, is a creative 
tension that could eventually force those who 
claim to have produced metallic hydrogen to 
do enough experiments, with enough controls, 
to compile enough lines of evidence to con- 
vince even the pickiest of critics. Until then, 
the trophy seems to be up for grabs. m 


CARSTEN COSTARD 


Ivan Amato is a freelance writer based in 
Silver Spring, Maryland. 
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Adrian Owen has found a way to use brain scans to communicate with people previously 
written off as unreachable. Now, he is fighting to take his methods to the clinic. 


er 


BY DAVID CYRANOSKI 


patient was only 24 years old when his life was devastated by a car 

accident. Alive but unresponsive, he had been languishing in what 
neurologists refer to as a vegetative state for five years, when Owen, a neuro- 
scientist then at the University of Cambridge, UK, and his colleagues at 
the University of Liége in Belgium, put him into a functional magnetic 
resonance imaging (fMRI) machine and started asking him questions. 

Incredibly, he provided answers. A change in blood flow to certain 
parts of the man’s injured brain convinced Owen that patient 23 was 
conscious and able to communicate. It was the first time that anyone had 
exchanged information with someone in a vegetative state. 

Patients in these states have emerged from a coma and seem awake. 
Some parts of their brains function, and they may be able to grind their 
teeth, grimace or make random eye movements. They also have sleep- 
wake cycles. But they show no awareness of their surroundings, and 
doctors have assumed that the parts of the brain needed for cognition, 
perception, memory and intention are fundamentally damaged. They 
are usually written off as lost. 

Owen's discovery’, reported in 2010, caused a media furore. Medical 
ethicist Joseph Fins and neurologist Nicholas Schiff, both at Weill Cor- 
nell Medical College in New York, called it a “potential game changer 
for clinical practice”. The University of Western Ontario in London, 
Canada, soon lured Owen away from Cambridge with Can$20 million 
(US$19.5 million) in funding to make the techniques more reliable, 
cheaper, more accurate and more portable — all of which Owen consid- 
ers essential if he is to help some of the hundreds of thousands of people 
worldwide in vegetative states. “It’s hard to open up a channel of commu- 
nication with a patient and then not be able to follow up immediately with 
a tool for them and their families to be able to do this routinely,’ he says. 

Many researchers disagree with Owen's contention that these 
individuals are conscious. But Owen takes a practical approach to apply- 
ing the technology, hoping that it will identify patients who might respond 
to rehabilitation, direct the dosing of analgesics and even explore some 
patients feelings and desires. “Eventually we will be able to provide some- 
thing that will be beneficial to patients and their families, he says. 

Still, he shies away from asking patients the toughest question of all — 
whether they wish life support to be ended — saying that it is too early 
to think about such applications. “The consequences of asking are very 
complicated, and we need to be absolutely sure that we know what to do 
with the answers before we go down this road,” he warns. 


A drian Owen still gets animated when he talks about patient 23. The 


LOST AND FOUND 

With short, reddish hair and beard, Owen is a polished speaker who is 
not afraid of publicity. His home page is a billboard of links to his televi- 
sion and radio appearances. He lectures to scientific and lay audiences 
with confidence and a touch of defensiveness. 

Owen traces the roots of his experiments to the late 1990s, when he 
was asked to write a review of clinical applications for technologies such 
as fMRI. He says that he had a “weird crisis of confidence”. Neuroimag- 
ing had confirmed a lot of what was known from brain mapping studies, 
he says, but it was not doing anything new. “We would just tweak a psych 
test and see what happens,’ says Owen. As for real clinical applications: 
“T realized there weren't any. We all realized that” 

Owen wanted to find one. He and his colleagues got their chance in 
1997, with a 26-year-old patient named Kate Bainbridge. A viral infection 


had put her in a coma — a condition that generally persists for two to four 
weeks, after which patients die, recover fully or, in rare cases, slip into 
a vegetative or a minimally conscious state — a more recently defined 
category characterized by intermittent hints of conscious activity. 

Months after her infection cleared, Bainbridge was diagnosed as 
being in a vegetative state. Owen had been using positron-emission 
tomography in healthy people to show that a part of the brain called the 
fusiform face area (FFA) is activated when people see a familiar face. 
When the team showed Bainbridge familiar faces and scanned her brain, 
“it lit up like a Christmas tree, especially the FFA’, says Owen. “That was 
the beginning of everything.” Bainbridge was found to have significant 
brain function and responded well to rehabilitation’. In 2010, still ina 
wheelchair but otherwise active, she wrote to thank Owen for the brain 
scan. “It scares me to think of what might have happened to me if I had 
not had mine,’ she wrote. “It was like magic, it found me” 

Owen moved from visual to auditory tests — “up the cognition ladder, 
from basic sound perception, to speech perception and then to speech 
comprehension” For example, he presented people in a vegetative state 
with phrases containing words that sound the same but have two mean- 
ings, such as “The dates and pears are in the bowl”. The ambiguity forces 
the brain to work harder and shows up in characteristic {MRI patterns 
in healthy people — if, that is, they are comprehending the words. One 
of Owen's patients, a 30-year-old man who had been incapacitated by 
a stroke, showed the same pattern’. But not everyone was convinced 
that these signs pointed to comprehension. “Every time I would go toa 
neurologist or anaesthesiologist and say, ‘he’s perceiving speech; theyd 
ask ‘but is he conscious?’.” Owen realized that he needed a different 
experiment to persuade the sceptics. 


ANYONE FOR TENNIS? 

It was June 2006. Wimbledon was on, and in a headline-stealing study, 
Owen took fMRI scans of a 23-year-old woman in a vegetative state 
while he asked her to imagine playing tennis and walking through the 
rooms of her house. When healthy, conscious adults imagine playing 
tennis, they consistently show activation in a region of the motor cortex 
called the supplementary motor area, and when they think about navi- 
gating through a house, they generate activity in the parahippocampal 
gyrus, right in the centre of the brain. The woman, who had been unre- 
sponsive for five months after a traffic accident, had strikingly similar 
brain activation patterns to healthy volunteers who were imagining 
these activities, proving, in Owen’s mind, that she was conscious. The 
result, published in a one-page article in Science’, evoked wonder and 
disbelief. “I got two types of e-mail. People either said ‘this is great’ or 
‘how could you possibly say this woman is conscious?’;’ Owen says. 

Other researchers contended that the response was not a sign of 
consciousness, but something involuntary, like a knee-jerk reflex. Daniel 
Greenberg, a psychologist at the University of California, Los Angeles, 
suggested in a letter to Science that “the brain activity was unconsciously 
triggered by the last word of the instructions, which always referred to 
the item to be imagined”. 

But Owen went on to bolster his case. Working with neurologist 
and neuroscientist Steven Laureys from the University of Li¢ge, Owen 
showed that of 54 patients in a vegetative or minimally conscious state, 
five responded in the same way as the first woman’. Four of them were 
in a vegetative state. After refining their methods, the researchers asked 
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patient 23 to use that capability to answer yes-or-no questions: imagine 
playing tennis for yes, navigating the house for no. They then asked about 
things that the technicians scoring the brain scans couldn't possibly know. 

Is your father’s name Thomas? No. Is your father’s name Alexander? 
Yes. Do you have any brothers? Yes. Do you have any sisters? No. The 
experiment is no easy feat for the patient. Owen's protocol demands 
patients maintain focus for 30 seconds then rest for 30 seconds, with 
lots of repetition. 

In front of a computer screen showing the {MRI data, Owen traces a 
blue line indicating activity in the supplementary motor area — a ‘yes’ — 
as it rises during the ‘answer’ period. It dives during the rest periods. A 
red line — indicating activity in the parahippocampal gyrus — represents 
the ‘no. The lines are sharp and clear, and Owen, who hasa taste for puns, 
calls the implication “a no-brainer”. “You don't need to be a functional- 
imaging expert to appreciate what this person is 
telling you,’ he says. The patient answered five of 
six questions correctly’. There was no discernible 
signal for the sixth. 

Russell Poldrack, a neuroimaging expert at the 
University of Texas at Austin, calls Owen’s meth- 
ods ingenious. “When I want to give someone examples in which {MRI 
has told us something we really didn’t know before, I use these,” he says. 

But Parashkev Nachev, a clinical neuroscientist at Imperial College 
London, criticizes the work for “assuming that consciousness is a binary 
phenomenon” Many patients, such as those having certain types of epi- 
leptic seizures, exhibit limited responsiveness without being conscious. 
Nachev says that more data are needed to indicate where in the con- 
tinuum of cognitive abilities people in vegetative states fall. 

Owen agrees that consciousness is not an “on-or-off thing” He sees 
it as an “emergent property” of many “modules” of the brain working 
together. Enough of these modules are at work in his exercise, he says, for 
responsive patients to qualify as being conscious. A person needs long- 
term memory to know what tennis is, short-term memory to remember 
the question or command and intention to give an answer. Ultimately, 
Owen is not concerned with pinpointing a threshold of consciousness or 
with providing a comprehensive definition for it. He takes a “know it if 
you see it” approach. Responding to commands and questions — com- 
munication — is an undeniably conscious activity, in his view. “In the 
end if they say they have no reason to believe the patient is conscious, I 


say ‘fine, but I have no reason to believe you are either,” he says. 


TO THE CLINIC 

Currently, there are tens of thousands of people in a vegetative state in 
the United States alone. Owen reckons that up to 20% of them are capa- 
ble of communicating; they just don’t have a way to do so. “What we're 
seeing here is a population of totally locked-in patients,’ Owen says. 

Owen now wants to put his technique into the hands of clinicians and 
family members. So far, the technology has done little. The first woman 
in the tennis study died last year, and patient 23, for logistic and financial 
reasons, was assessed only once. Even if a person in a vegetative state is 
‘found; there is no guarantee that he or she will later be able to return a 
normal life. Owen nevertheless insists that “clarifying” a patient's state 
of consciousness helps families to deal with the tragedy. “They want to 
know what the diagnosis really is so that they can move on and deal with 
that. Doubt and uncertainty are always bad things.” 

Two years ago, Owen was awarded a 7-year Can$10-million Canada 
Excellence Research Chair and another $10 million from the University 
of Western Ontario. He is pressing forward with the help of three new 
faculty members and a troop of postdocs and graduate students. 

An early goal of the programme was to repeat the fMRI findings using 
an electroencephalogram (EEG)’. An EEG lacks fMRI’s precision, and 
it cannot look as deeply into the brain, so the regions active in the ten- 
nis study were “off the menu’, says Owen. But other tasks — imagining 
wiggling a finger or toe — produce signals that, through repetition, 
become clear. An EEG is also cheap, relatively portable and fast (with 
milliseconds of lag compared with 8 seconds for fMRI), meaning that 
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the research team can ask up to 200 questions in 30 minutes. “From a 
single trial youre not going to say, ‘that person is saying yes, but if they 
get 175 of 190 right when tested, it’s pretty clear” 

Now, using an EEG, Owen is planning to study 25 people in a vegetative 
state every year. He will have the help of a new ‘EEGeep, a jeep equipped 
with experimental equipment that will allow the researchers to travel 
around to test patients who cannot be transported to Western Ontario. 

One goal is to identify other brain systems, such as smell or taste, that 
might be intact and usable for communication. Imagining sucking a 
lemon, for example, can produce a pH-level change in the mouth anda 
recognizable brain signal®. Owen has shown that registering jokes pro- 
vokes a characteristic response in healthy people’ and plans to try it on 
patients in a vegetative state. He hopes that he can use these tests to find 
some level of responsiveness in patients who cannot produce the tennis 
and navigation patterns of activity because of their 
level of brain damage. 

The studies will also explore whether these 
patients have the capacity for greater intellectual 
depth. Owen thinks that some people in a vegeta- 
tive state will eventually be able to express hopes 
and desires, perhaps like French magazine editor Jean-Dominique 
Bauby, who dictated his memoirs by repeatedly winking one eye. “I 
don't see a reason why they could not have a similar richness of thought, 
although undoubtedly some will not,’ Owen says. 

His techniques could also radically change treatment. Owen is already 
asking patients whether they feel pain. The answers will be useful in dos- 
ing pain killers, and similar tests could even be used in intensive-care 
units to guide rehabilitation resources, says Loretta Norton, a graduate 
student who is undertaking a study for this purpose. But she recognizes 
that this will be controversial. 


DECISION TIME 

Owen's methods raise more difficult dilemmas. One is whether they 
should influence a family’s or clinician’s decision to end a life. If a patient 
answers questions and demonstrates some form of consciousness, he 
or she moves from the ‘possibly allowed to die’ category to the ‘not gen- 
erally allowed to die’ category, says Owens. Nachev says that claiming 
consciousness for these patients puts families in an awkward position. 
Some will be given hope and solace that their relative is still ‘in there 
somewhere’ Others will be burdened by the prospect of keeping them 
alive on the basis of what might be ambiguous signs of communication. 

Even more ethically fraught is whether the question should be put 
to the patients themselves. Fins and Schiff question whether patients 
would ever be able to show that they can understand the complexities 
of that question in the way that is normally demanded of, for example, 
patients giving informed consent. 

Owen hopes one day to ask patients that most difficult of questions, 
but says that new ethical and legal frameworks will be needed. And it will 
be many years, he says, “before one could be sure that the patient retained 
the necessary cognitive and emotional capacity to make such a complex 
decision”. So far, he has stayed away from the issue. “It might bea little 
reassuring if the answer was ‘no but you can't presuppose that.” A ‘yes’ 
would be upsetting, confusing and controversial. 

For now, Owen is hoping to use the technology to find other respond- 
ers like Kate Bainbridge — who Owen now describes as a “motivational 


force”. “Otherwise,” he says, “what's the point?” m 


David Cyranoski is Nature’ Asia-Pacific correspondent. 
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The Milky Way as a combined image of near-infrared (yellow), infrared (red) and X-ray (blue and violet) data collected by NASA’s ‘Great Observatories’. 


A midlife crisis for 
X-ray astronomy 


As the field celebrates its 50th birthday, Martin Elvis asks 
how to Keep this unique window into the Universe open. 


n 18 June 1962, an 8-metre rocket 
() carried three small X-ray detectors 

to the edge of space. They spent 
just under 6 minutes above the altitude of 
80 kilometres, high enough for kiloelec- 
tronvolt-energy X-rays from space to reach 
them through the thinned atmosphere. The 
result of this brief flight by physicist Riccardo 
Giacconi and his colleagues revolutionized 
astronomers’ view of what the Universe 
contains. 

For 50 years, X-ray astronomy has 
burgeoned. The field has grown a billion 
times more sensitive — a feat that took opti- 
cal astronomy 400 years to achieve — and 


has opened up such exotica as black holes 
and dark matter to detailed investigation. It 
has provided a unique window into ‘extreme’ 
places of the Universe, where gas can be 
1,000-10,000 times hotter than the surface 
of the Sun. The question now is whether that 
progress will continue. 

In the 1960s, X-rays were known to be 
produced by the Sun — that had been discov- 
ered in 1948. But the Sun’s weak X-ray output 
made it seem futile to try to detect any other 
star this way. Giacconi justified his flight by 
telling the US Air Force that he was looking 
for X-rays from the Moon, but he hoped to 
find more. Fortunately, the Universe delivered 
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surprises: a strong source of X-rays in the 
constellation Scorpius and a bright back- 
ground of X-rays from all over the sky. The 
researchers were lucky. Had they looked when 
Sco X-1, as this first source was dubbed, was 
below the horizon, they would have detected 
only the bright background noise. Interest 
in cosmic X-rays might have withered. 
Many teams rushed to follow up in 
the subsequent years. X-ray astronomy 
papers often dominated the pages of 
Astrophysical Journal Letters. One important 
discovery was that the brightness of these 
cosmic X-ray sources often changed within 
seconds. Because nothing can change its 
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output faster than light can travel across it, 
this meant that the objects must be tiny — 
just light seconds across. Creating so much 
energy from such a small volume required 
some not-yet recognized energy source. This 
turned out to be gravity: the accretion of gas 
falling down the deep gravitational well ofa 
compact star converts potential energy into 
millions of degrees of heat. 

The next big step was the development of 
orbiting X-ray instruments, which allowed 
for exposure times of days rather than min- 
utes. NASA’s 1970 Uhuru mission, led by 
Giacconi, was the first in a small fleet of these. 
Uhuru and its successors detected hundreds of 
X-ray sources 1,000-10,000 times fainter than 
Sco X-1. 

But it was the cosmic X-ray background 
that set the real programme for X-ray astro- 
nomy for the next 40 years. Was it caused by 
hot gas pervading intergalactic space, or by 
millions of faint and distant sources blending 
together? Early detectors could not answer 
this question, because they had to collect 
X-rays from a large chunk of the sky to detect 
a signal; discrete sources that might make up 
the background produced no more than one 
X-ray per square centimetre of detector each 
day. To see with greater precision, research- 
ers needed more sensitive telescopes, which 
they built from nested cylindrical mirrors. 

The first such imaging X-ray telescope 
— NASA’ 1978 Einstein Observatory, led 
by Giacconi — detected radiation 100-fold 
fainter than any of its predecessors. It found 
that 20% of the cosmic X-ray background 
came from active galactic nuclei, later under- 
stood to host supermassive black holes in 
their centres. 


LEAPS AND BOUNDS 
By 1987 — the twenty-fifth anniversary of 
the discovery of Sco X-1 — X-ray astronomy 
had become 10 million times more sensitive. 
The next generation of X-ray telescopes 
came in the early 1990s. Notable among 
them was Germany’s ROSAT, which sur- 
veyed the whole sky, cataloguing 100,000 
X-ray sources and, by going tenfold fainter 
still than Einstein, resolving some 60% of 
the cosmic X-ray background that had an 
energy lower than 2 kiloelectronvolts (keV). 

The current era was ushered in by the 
launch of two large X-ray observatories in 
1999: NASA's Chandra and the European 
Space Agency’s XMM-Newton. XMM has a 
large collecting area, but Chandra — with 
an angular resolution ten times better than 
anything previous — was revolutionary. 
Giacconi received the Nobel prize in 2002, 
perhaps stimulated by the outpouring of 
results from Chandra, which he had been 
instrumental in getting started. 

Chandra has resolved the cosmic X-ray 
background at all energies up to 7 keV. 
More than 90% of the background is due to 
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the summed emissions of millions of active 
galaxies (most of the rest comes from fainter, 
star-forming galaxies). Chandra and XMM 
have also delimited the extent of dark matter’s 
interaction with itself, determined properties 
of dark energy and showed that as much as 
one-fourth of the energy of a supernova goes 
into accelerating protons, solving the mystery 

of the origin of high-energy cosmic rays. 
Fifty years on, the founding mysteries of 
X-ray astronomy have been solved. But our 
understanding of more complex questions 
is still primitive. In my own sub-field alone, 
we don't yet know why active galaxies emit 
X-rays, where the massive black holes at their 
centres come from or how a massive black 
hole accelerates 


“Prospectsfor —_matter in bulk to 
asetof ‘Greater speeds more than 
Observatories’ —_ 90% that of light in 
that span the a tight beam that 
electromagnetic can extend over 
spectrum look millions of light 
bleak.” years. 


At the same 
time, X-ray astronomy has the power to test 
two bases of twentieth-century physics — 
general relativity and quantum chromody- 
namics (which describes interactions of the 
‘strong force’ between quarks and gluons) 
— in the extreme conditions around black 
holes and neutron stars. 

But for astronomers to answer these ques- 
tions, X-ray astronomy must become even 
more sensitive, by collecting more photons 
with larger mirrors while maintaining the 
fine resolution seen with Chandra. The US 
2010 decadal survey for astronomy ranked 
a larger-area X-ray mission fourth among 
major space missions, but that is low enough, 
given the restricted US science budget, to put 
anew X-ray observatory on the back burner 
for at least a decade. Each space observatory, 
be it optical, infrared or X-ray, now costs 
more than US$2 billion. The worldwide 
space astronomy budget of about $5 billion 
a year is not enough to sustain a comprehen- 
sive programme. 

NASAs current ‘Great Observatories’ — 
Hubble, Chandra and Spitzer — span the 
infrared to X-ray bands. They will prob- 
ably have just one successor: the James 
Webb Space Telescope (JWST), which 
was designed to study cosmology and gal- 
axy evolution in the distant Universe, and 
so works primarily in the infrared. Over 
the rest of the spectrum, astronomers will 
be blind. A US successor to Chandra is 
unlikely until 2030. By then, the JWST 
will probably be dead, preventing cross- 
fertilization of ideas 
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can have more modest price tags of around 
$200 million. In the next few years, several 
such missions will be launched — one in 
pursuit of higher-energy rays (NuSTAR, 
part of NASA’s Explorer class), one as an 
updated ROSAT (Germany’s eROSITA) and 
one with relatively high spectral resolution 
(Japan's ASTRO-H). But none of these even 
approaches Chandra’s spatial resolution. 
Since 2010, some ideas for achieving 
more powerful missions on a tight budget 
have emerged, including combining the fine 
imaging of Chandra with a light-gathering 
surface 30 times more powerful. This would 
require the development of mirrors that can 
be actively corrected to ensure a sharp image, 
which seems feasible. But such a project 
would cost at least $2 billion. Prospects for 
a set of ‘Greater Observatories’ that span the 
electromagnetic spectrum look bleak. 


A PROFITABLE SOLUTION 

The only viable solution is to lower the cost 
of getting equipment into orbit. Launch 
costs have held steady at some $10,000 per 
kilogram of payload for more than 50 years. 
Spacecraft must have ingenious designs to 
keep their masses low, which makes the cost 
of building and launching high: $100,000 or 
more per kilogram of craft. Overall mission 
costs could plummet if launch costs were to 
decline. 

The best — perhaps only — way to lower 
launch costs is to allow private enterprises 
to profit from space ventures. Profit is the 
counterweight to caution, and competition 
will drive down costs. Several companies 
have already developed operational launch- 
ers for government payloads, including 
Orbital Sciences Corporation of Dulles, Vir- 
ginia, whose Pegasus rocket was, as Nature 
went to press, scheduled to launch NuSTAR 
on 13 June, and SpaceX of Hawthorne, Cali- 
fornia, which last month launched the first 
private mission to resupply the International 
Space Station. 

Ultimately, a healthy market that spurs 
lower costs will require customers other 
than the government, such as those seek- 
ing minable resources more plentiful in 
space than on Earth. One new company, 
Planetary Resources of Bellevue, Wash- 
ington, announced in April its aim to mine 
asteroids. 

Such profit-seeking could decrease launch 
costs by more than an order of magnitude, 
such that fuel costs dominate the overall 
price. Then, and only then, will new gen- 
erations of greater observatories, in X-rays 
and across the electromagnetic spectrum, 
become affordable for scientists. m 


Martin Elvis is at the Harvard-Smithsonian 
Center for Astrophysics, Cambridge, 
Massachusetts 02138, USA. 

e-mail: elvis@cfa.harvard.edu 
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Water levels in the Mekong Basin could rise or fall with climate change — models cannot say which. 


Climate models 
at their limit? 


Estimates of climate-change impacts will get less, 
rather than more, certain. But this should not excuse 
inaction, say Mark Maslin and Patrick Austin. 


science by the Intergovernmental 

Panel on Climate Change (IPCC), due 
to be released next year, climate scientists 
face a serious public-image problem. The 
climate models they are now working with, 
which make use of significant improvements 
in our understanding of complex climate 
processes, are likely to produce wider rather 
than smaller ranges of uncertainty in their 
predictions. To the public and to policy- 
makers, this will look as though the scientific 
understanding of climate change is becom- 
ing less, rather than more, clear. 

Scientists need to decide how to explain 
this effect. Above all, the public and policy- 
makers need to be made to understand that 
climate models may have reached their limit. 
They must stop waiting for further certainty 
or persuasion, and simply act. 

Why do models have a limited capabil- 
ity to predict the future? First of all, they 
are not reality. This is perhaps an obvious 
point, but it is regularly ignored. By their 
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very nature, models cannot capture all the 
factors involved in a natural system, and 
those that they do capture are often incom- 
pletely understood. Science historian Naomi 
Oreskes of the University of California, San 
Diego, and her colleagues have argued con- 
vincingly that this makes climate models 
impossible to truly verify or validate’. 

The more-concrete, less-philosophical 
problems can be illustrated by following 
the path of cascading uncertainties that are 
building up in the models used today. 

One of the first inputs into any climate 
model is the expected accumulation of 
greenhouse gases and aerosols in the 
atmosphere by the end of the century. These 
projections are based on economic mod- 
els that predict global fossil-fuel use over 
100 years given broad assumptions about 
how green the global economy will become. 
The economic collapse of 2008 showed dra- 
matically, and to our cost, how difficult it is 
to predict changes in the economy. And eco- 
nomic unpredictability is just the beginning. 
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Another layer of uncertainty comes from 
how the global climate models are weighted. 
For example, in the most recent IPCC assess- 
ment, released in 2007, the economic sce- 
narios were input into more than 20 general 
circulation models. Every model has its own 
design and parameterizations of key pro- 
cesses, such as how to include the effects of 
clouds; and every model and its output was 
assumed to be equally valid, even though 
some perform better than others in certain 
ways when tested against historic records. 
The differences between the models will be 
exacerbated in the 2013 IPCC assessment, 
because many, but not all, of the models have 
improved spatial resolution. 

The outputs from the circulation models 
are then often used to drive detailed regional 
climate models to predict local environ- 
mental variations. Such regional models 
have huge uncertainties, thanks largely to 
the fact that precipitation is highly variable 
over small scales of time and space. This 
leads to a large range of potential futures, 
some of which contradict others. For exam- 
ple, detailed hydrological modelling of the 
Mekong River Basin using climate model 
input from the UK Met Office’s HadCM3 
model projects changes in annual river dis- 
charge that range from a decrease of 5.4% 
to an increase of 4.5% (ref. 2). Changes in 
predicted monthly discharge are even more 
dramatic, ranging froma fall of 16% to a rise 
of 55%. Advising policymakers becomes 
extremely difficult when models cannot pre- 
dict even whether a river catchment system 
will have more or less water. 

Projected regional changes are then used 
as a basis for ‘impact models’ that estimate 
the effect on the quality of human life. But 
these effects often depend more on the rela- 
tive resilience of a given society than on the 
magnitude of environmental change. Even 
the most advanced socioeconomic models, 
which look at the monetary costs arising both 
in market and non-market sectors, often fail 
to account adequately for major aspects of 
human suffering that are hard to quantify’. 


STRUGGLE WITH EXTREMES 

Akey debate has emerged between scientists 
about how well models can predict extreme 
climates. On the optimistic side, Tim Lenton 
of the University of Exeter, UK, has argued 
that, with more research, models will help 
to provide an early warning system of cli- 
matic tipping points such as the melting of 
the Greenland ice sheet, the dieback of the 
Amazon rainforest and the shift of the West 
African monsoon‘, By contrast, Paul Valdes 
of Bristol University, UK, argues that climate 
models are too stable, built to ‘not fail’ rather 
than to simulate abrupt climate change’. 
When the current IPCC models were tested 
against four major past climate changes, he 
notes, two were unable to even get the basic 
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climate before the shift correct and the other 
two had to be fed parameters up to ten times 
greater than would be realistic to produce 
the abrupt shift. 

The climate models, or ‘climate simula- 
tors’ as some groups are now referring to 
them, being used in the IPCC’s fifth assess- 
ment make fewer assumptions than those 
from the last assessment, and can quantify 
the uncertainty of the complex factors they 
include more accurately. Many of them 
contain interactive carbon cycles, better 
representations of aerosols and atmospheric 
chemistry and a small improvement in spa- 
tial resolution. 

Yet embracing more-complex processes 
means adding in ‘known unknowns, such 
as the rate at which ice falls through clouds, 
or the rate at which different types of land 
cover and the oceans absorb carbon diox- 
ide. Preliminary analyses show that the new 
models produce a larger spread for the pre- 
dicted average rise in global temperature. 
Additional uncertainty may come to light 
as these models continue to be put through 
their paces. Dan Rowlands of the University 
of Oxford, UK, and his colleagues have run 
one complex model through thousands of 
simulations, rather than the handful of runs 
that can usually be managed with available 
computing time. Although their average 
results matched well with IPCC projections, 
more extreme results, including warming of 
up to 4°C by 2050, seemed just as likely*. As 
computing power becomes more accessible, 
that ‘hidden’ uncertainty will become even 
more obvious. 


STABLE AND CONFIDENT 
None of this means that climate models are 
useless. The present models are clearly able 
to reproduce natural climate variability over 
the past 150 years, and have provided an 
essential test of the theoretical link between 
CO, and global temperatures. Their vision 
of the future has in some ways been incred- 
ibly stable. For example, the predicted rise in 
global temperature for a doubling of CO, in 
the atmosphere hasn't changed much in more 
than 20 years (see ‘Prediction stability’). 

This message of stability and confidence 
is often lost on the public: when Andreas 
Schmittner of Oregon State University in 
Corvallis published a value for the climate’s 
sensitivity to a doubling of CO, that was 
on the low side of previous estimates’, the 
media declared that “climate sensitivity was 
overestimated”. This was despite some sig- 
nificant criticisms of the study and the fact 
that the findings still fell within the range of 
those in all the IPCC reports. This is a sad 
indicator of the headlines we might expect 
in the wake of the next IPCC report. 

One approach to tackling the public- 
perception problem is to subtly rephrase the 
conclusions, placing the uncertainty on the 
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PREDICTION STABILITY 


Estimates of climate sensitivity — the rise in global temperature caused by a doubling of atmospheric 
carbon dioxide levels — have remained fairly steady for decades. 
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date by which things will happen, rather than 
onto whether they will happen at all. A recent 
study, for example, showed that the politically 
expedient 2 °C limit will be reached between 
2040 and 2100, depending on our emission 
pathway and the model used’. This ‘wher’ 
not ‘if? approach is powerful. 

In the face of scientific uncertainty, vari- 
ous philosophies for decision-making have 
arisen. But perhaps the best approach is to 
ensure that policies include ‘win-win’ strat- 
egies. Supporting a huge increase in renew- 
able energy would reduce emissions and 
help to provide energy security by reduc- 
ing reliance on imported oil, coal and gas. 

Reduced defor- 


“The biggest estation and refor- 
obstacle is the estation should 
unwillingness draw-down CO, 
of politiciansto —_ from the atmos- 
actinthelong- _ phere and help to 
terminterests of _ retain biodiversity, 
society. » stabilize soils and 

provide livelihoods 


for local people through carbon credits. 
Measures that lessen car use will increase 
walking and cycling, which in turn reduces 
obesity and heart attacks. No one can object 
to creating a better world, even if we turn out 
to be extremely lucky and the scale of climate 
change is at the low end ofall projections. 
The biggest obstacle is the unwillingness 
of politicians to act in the long-term inter- 
ests of society. Politicians use public opin- 
ion and scientific uncertainty as excuses for 
inaction. They used to say “we need to wait 
until scientists prove that mankind is caus- 
ing climate change”. That hurdle has, argu- 
ably, passed, so now they have moved on to 
“we need to wait until scientists can tell us 
exactly what will happen and what the costs 
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are’, or, “we need to wait for public opinion 
to be behind action”. The former will never 
occur, because modelling can never provide 
that level of certainty. The latter is a sleight 
of hand. Politicians often take action without 
public support, from wars to bank bailouts, 
taxation to health-care reforms. 

Greater knowledge and improved models 
will always be desirable, but they are not a 
panacea for political and public reticence 
to action on climate change. Despite the 
uncertainty, the weight of scientific evi- 
dence is enough to tell us what we need to 
know. We need governments to go ahead 
and act, as both the United Kingdom and 
Mexico have done in making national laws 
that contain carbon reduction targets of 
80% and 50%, respectively, by 2050. We do 
not need to demand impossible levels of cer- 
tainty from models to work towards a better, 
safer future. = 


Mark Maslin and Patrick Austin are with 
the Environment Institute and Department 
of Geography, University College London, 
Pearson Building, Gower Street, London, 
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Sanitation for all 


Water pollution from sewage is causing great damage to India. The nation needs to 
complete its waste systems and reinvent toilet technologies, says Sunita Narain. 


anitation is a much-sanitized word. 

It hides the horror of disease and the 

crippling indignity that people have 
to endure when they do not have access to 
a toilet. It also hides the technology divide 
for human excreta, which favours the rich 
in its collection, conveyance and disposal. 
The only solution is a complete sanitation 
system — toilets that are connected to a 
waste-removal and treatment system. But 
conventional technology does not work 
everywhere, or for all. 

Rapidly-modernizing India is drowning 
in its own excreta. According to the World 
Health Organization, more than 600 million 
Indians practise open defecation — making 
up 60% of the 1.1 billion people who do so 
worldwide’. But even as toilets get built, the 
challenge of managing excreta grows. 

More than 87% of people in India’s cities 
(compared with 33% in rural areas) now 
have access to a toilet”. But leaking and 
incomplete sewage systems contaminate 
rivers and lakes, causing diseases such as 
cholera. Around 97 million Indians do not 
have access to clean drinking water, putting 
the nation second only to China’. Similar 
problems afflict other developing countries. 

The challenge for India is to come up with 
ways of dealing with excreta that are afforda- 
ble and sustainable. The first step is to match 
investment in waste-water systems with that 
for water supply. Innovative and affordable 
toilets must be designed and accessible to 
everyone. Only if all waste is treated can 
pollution be controlled. 


THE SEWAGE SPIRAL 
Emerging countries such as India are follow- 
ing a sanitation trajectory, gradually upgrad- 
ing facilities from no toilets to sophisticated 
systems (see ‘India's sanitation divide’). By 
2011, just 20% of urban Indians had no toilet 
or used a public latrine’. But most of those 
with toilets had them connected to septic 
tanks or flowing into open drains. Only the 
top end — one-third of urban households 
and 2% of rural — had toilets that were con- 
nected to underground sewage networks, 
and not all of those reached treatment plants. 
Progress along the trajectory is slow 
because the technology for collecting and 
disposing of excreta was invented in the 
water- and money-rich industrialized world. 
It is capital-intensive, favouring the rich over 
the poor. And it is resource intensive, using 
huge quantities of water. 


City planners worry more about supplying 
water to their citizens than about the waste 
water generated. Yet the effluent inevitably 
goes into streams, lakes and rivers, or seeps 
into the ground to contaminate drinking 
water. Nitrate levels in groundwater across 
India exceed 45 milligrams per litre — a sure 
sign of sewage contamination’. 


INDIA’S SANITATION DIVIDE 


City dwellers are more likely than people in rural 
areas to have flushing toilets, but only one-third 
of those toilets are connected to underground 
sewers; the rest go to septic tanks. 


LUREAN I RUGAL 
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Leaked sewage leads to a deadly and 
costly spiral. As surface or groundwater 
gets contaminated, the city must source 
clean water from farther afield. The cost of 
pumping water rises — it now accounts for 
30-50% of the price of supplying water. The 
cost of building and maintaining pipelines 
increases. And if the network is not main- 
tained, water is lost — 30-50% of the water 
leaks. The nation therefore has less clean 
water to supply and needs to pay more to 
get it to the people. It cannot provide these 
services for everyone, and it chooses the rich. 
As the water system degrades, the rich move 
to bottled water. The poor get sick. 

The over-burdened water utility then 
has no money to invest in sewage facilities. 
Most Indian cities do not have underground 
systems, and those that do, have old pipes 
that are in disrepair. There are few treat- 
ment plants. Officially, the country has the 
capacity to treat 30% of its waste water”. But 
in practice only 20% is processed: not all 
plants function and the pipes leak. The final 
blow comes when the treated waste water of 
the minority gets mixed with the untreated 
sewage of the majority. 

Most believe that the sanitation divide will 
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eventually disappear: one day, everywhere 
will have flush toilets, sewers and treatment 
plants. But in reality, Indian cities are way 
behind: they are growing so fast that their 
infrastructure cannot catch up. This cycle 
must be broken. 


THINK OUTSIDE THE BOX 

Governments must demand change in how 
water and waste are managed. They must 
cut the length of the pipeline, by investing in 
local water supplies such as lakes and ponds 
and using water more efficiently. They must 
invest more in sewage systems, even before 
they invest in water supply. Water accounts 
and tariffs must reflect the full cost of supply 
and of waste collection and treatment. 

The challenge for science is to look 
beyond the modern engineering mindset. 
We know that current technologies, which 
use large amounts of clean water to transport 
small amounts of excreta through expensive 
pipes to costly treatment plants, are unwork- 
able and unaffordable in much of the world. 
Yet toilets and sewage disposal are among 
the least researched technologies. 

New technologies and new thinking are 
urgently needed for use across diverse eco- 
systems. Open drains might become planted 
waterways, with the vegetation cleansing the 
water. Or microbes might be used to decom- 
pose and de-pathogenize effluent. Sewage 
must be treated as a resource — turned into 
water for drinking, irrigation or industry. 

Cities must leapfrog within the sanitation 
trajectory — go straight from no toilets to 
hygienic toilets for all that do not cost the 
Earth. Only then can the challenge be met. m 


Sunita Narain is director general of the 
Centre for Science and Environment in New 
Delhi and author of Excreta Matters: How 
Urban India is Soaking up Water, Polluting 
Rivers and Drowning in its Own Waste 
(State of India’s Environment, Centre for 
Science and Environment, 2012). 

e-mail: sunita@cseindia.org 
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An ‘eco’ toilet such as Peter Morgan’s Blair VIP, which traps disease-carrying flies, can transform lives. 


The bottom line 


If architecture is ‘design for living’, one of its greatest 
challenges is how to live with the masses of waste we 
excrete. Four pioneers in green sanitation design outline 
solutions to a dilemma too often shunted down the pan. 


very year, on average, each of us excretes 
HB: litres of faeces, rife with pathogens 

and heavy metals. Multiplied by Earth's 
population of 7 billion — and rising — that 
constitutes not so much an elephant in the 
room asa herd of mammoths. Sustainable 
solutions are urgently needed, particularly 
for the 2.6 billion people who lack adequate 
sanitation and the 1.1 billion practising open 
defecation. Rich countries, meanwhile, often 
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have hidden sanitation issues of their own. 
There is no single design solution to sani- 
tation. But there are universal principles for 
systematically and safely detoxifying human 
excreta, without contaminating, wasting 
or even using water. Ecological sanitation 
design — which is focused on sustainability 
through reuse and recycling — offers work- 
able solutions that are gaining footholds 
around the world, as Nature explores on the 
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following pages through the work of Peter 
Morgan in Zimbabwe, Ralf Otterpohl and 
his team in Germany, Shunmuga Parama- 
sivan in India, and Ed Harrington and his 
colleagues in California. 

In compost-based ecosanitation, excreta is 
reframed asa resource: fertilizer. Much of the 
research on this has focused on upping nutri- 
ent levels and finding faster, more effective 
ways of removing heavy metals and patho- 
gens such as viruses. Meanwhile, research- 
based, ecological processing of waste water is 
vastly improving water-based systems, bring- 
ing them closer to the ecosanitation ideal. 

Environmental sustainability is only part 
of ecosanitation, however. Defecation is as 
culture-laden as other behaviours, so the 
designs must also be socially sustainable 
— tailored to local customs and strictures, 
whether in Malawi or Manhattan. 

The developed world may think it has 
cracked the problem, but trouble is gurgling 
away underground. ‘Flush and forget’ sani- 
tation systems constitute one of the more 
bizarre hangovers from the Victorian age. In 
older toilets, up to 25 litres of drinking water 
go down the pan per flush, although ‘low- 
flow’ toilet designs are coming into their own 
and, in 1995, the US federal government set 
a 7-litre-per-flush limit. 

Aside from wasted water, the faeces-laden 
‘black water’ from flush toilets is not always 
treated. Many older US and UK sewage sys- 
tems, for instance, mix toilet waste water with 
storm water in so-called combined sewage 
outflows, which can overflow after heavy 
rain. The US Environmental Protection 
Agency estimated in a 2004 report to Con- 
gress that 850 billion gallons of untreated 
water were entering US waterways every year. 

Sewage sludge — the semi-solid mush left 
after wastewater treatment in sewage works 
— can be as problematic. Although it can 
contain significant traces of pharmaceuticals 
and heavy metals even after treatment, it is 
widely used in the West as a soil conditioner 
and fertilizer on cropland, with uncertain 
effects on human health. 

In the packed cities and scattered villages 
of the developing world, the challenge is 
even more daunting. Thousands of children 
die every day from a lack of basic sanitation 
or clean water. Open defecation contami- 
nates soils with the eggs and larvae of soil- 
borne intestinal worms, or helminths, as well 
as other pathogens. More than one billion 
people are infected with these helminths, 
which cause, among other problems, weak- 
ness and malnutrition. 

So a toilet can be transformative. A clean 
environment means better health — and that, 
in turn, is a springboard to development. As 
governments debate the finer points of global 
development challenges at Rio+20 next week, 
they might find it worth asking why sanitation 
falls to the bottom of most policy agendas. = 
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PETER MORGAN 
Inspired by ant 
turrets and the 
flight paths of flies 


Environmental scientist and designer 
of the Blair VIP toilet, Harare 


“The designer knows he has reached perfec- 
tion, not when there is no longer anything to 
add, but when there is no longer anything to 
take away.’ I recalled this anonymous quote 
when, as a young biologist in Rhodesia (now 
Zimbabwe) in the 1970s, I was working for 
the health ministry's Blair Research Labo- 
ratory — named after former health secre- 
tary Dyson Blair. Blair had persuaded me to 
change fields, from schistosomiasis control 
to technological solutions for public health, 
and tasked me with designing new toilet 
systems for use in rural areas. 

Open defecation was then common, 
and the existing pit toilets bred blowflies of 
the genus Chrysomyia, as well as other fly 
species that carry enteric disease. In a survey 
conducted between March 1974 and April 
1975, we counted more than 20,000 flies 
emerging from a single pit toilet. 

It had long seemed to me that simplicity 
could be related to elegance of design. My 
first innovation, called the Blair toilet after 
Dyson (and later dubbed the Blair ventilated 
improved pit, or BVIP toilet), is simple: a 
pit, lined with bricks for stability; a con- 
crete sanitary slab with one hole for squat- 
ting, and another fitted with a vent pipe 
stretching from the slab to above roof level; 
and a spiral superstructure that obviates 
the need for a door but guarantees semi- 
darkness (see ‘Air traffic control’). 

The design harnesses natural prin- 
ciples. The turrets of ant nests — the 
most elegant of nature’s chimneys — 
inspired the vent. The natural behaviour 
of flies, which are attracted by odour and 
light, determined the other design features. 
When air passes over the top of the vent, 
suction draws more air through the 
squatting hole into the pit, then sends 
the odours up through the vent. Some 
flies are drawn to those odours; oth- 
ers, entering the pit through the squat 
hole, are drawn to the light from the 
bottom of the vent. Either way, the flies 
are trapped and die, because the vent 
is fitted with a non-corrodable screen, 
usually made of aluminium. 

The concept is simple and it works. 
From October to December 1975, weekly 
counts of fly output were taken from two 
Blair toilets and two unventilated pit toi- 
lets: a total of 13,953 flies were trapped 


from the unventilated toilets and only 
146 from the ventilated toilets (P R. Mor- 
gan Cent. Afr. J. Med. 23, 1-4; 1977). A fam- 
ily BVIP will last 10-15 years, and once it is 
abandoned, the superstructure materials can 
be recycled. The excreta gradually dries, and 
can be used as compost. 

The BVIP is now the backbone of 
Zimbabwe's sanitation programme, with 
halfa million family toilets built so far, and 
is widespread in other African countries. A 
multicompartment unit was designed for 
schools. More recently, Ihave drawn up a 
cheaper, upgradeable family unit, which can 
be built in stages — allowing them to ascend 
the ‘sanitation ladder’ 

I devised other toilets to speed up the 
composting process. The Arborloo is an 
unlined pit 1 metre deep, which is fitted with 
a circular brick or concrete rim and a sani- 
tary slab. As it fills, soil, ash and leaves are 
added to accelerate composting and control 
flies and odour. After a year, the toilet super- 
structure is moved to a new pit, anda tree is 
planted in a layer of soil on top of the old pit, 
to provide shade, fuel or fruit. Thousands of 


AIR TRAFFIC CONTROL 


The Blair toilet uses a vent to trap flies, 
preventing them from spreading disease. 
The vent also draws out odours from the pit, 
making the toilet interior almost odour free. 


Flies outside are attracted to the 
the odour from the vent pipe 
igh 
y Wind flow 


A fly screen prevents flies from 
entering and leaving the pit 


. 4 Flies from pit are 
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light of the vent 
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these have been built in Malawi and about 
70,000 in Ethiopia. 

The Fossa Alterna is another variation: 
two shallow pits are dug and used alternately, 
swapping at annual intervals. By the time 
one pit has filled, the compost in the other 
will be mature and ready for use. 

Perhaps working in an area for which 
I have not received formal training has 
given me freedom of expression in observ- 
ing, exploring and researching. I had to 
use instinct and plain logic. And it pays to 
adapt natural principles honed over millions 
of years. 


RALF OTTERPOHL 
Boosting compost 
with biochar and 
bacteria 


Environmental engineer and director, 
Institute of Wastewater Management 
and Water Protection, Hamburg 
University of Technology, Germany 


With my team in Hamburg, I have studied 
resource-oriented sanitation — in which 
waste is seen as reusable — for 15 years. I 
was drawn towards this path having started 
out mathematically modelling mass flows at 
large-scale wastewater treatment plants — a 
process I found frustratingly inefficient. 

In 2010, I began to focus on a practice 
originating in Brazilian Amazonia more 
than 1,000 years ago that could, paradoxi- 
cally, kick-start a modern revolution in 

composting sanitation. The pre-Colum- 
bian Indians created ‘black’ soils known 

in Portuguese as terra preta. Found in 
patches throughout the Amazon, they 
are composed of charcoal (biochar), 
composted excreta and other bio-waste. 
They are absorptive and high in nitrogen, 
phosphorus, potassium and calcium. At 
Hamburg, we are adapting this mixture for 
use in ecosanitation systems. 
Systems using terra preta technology 
can help to solve two problems that 
plague many developing countries: 
poor soils and a lack of sanitation. 
The technology offers the efficient 
creation of well-structured, humus- 
rich compost, which is important for 
food security, resistance to soil ero- 
sion, water retention in soil and the 
growth of local agricultural econo- 
mies. And it is cost-effective. A basic 
terra preta sanitation toilet costs about 
US$50, inputs are cheap, and it is not 
hooked up to sewage systems. 
Terra preta sanitation is a three-step 
process. First, lactic-acid bacteria are 
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added after each defecation: the anaerobic 
fermentation sanitizes and deodorizes. We 
have used cultures ranging from sauerkraut 
liquor to strains scientifically selected not 
to produce gas, such as Lactobacillus plan- 
tarum, Lactobacillus casei and Pediococcus 
acidilactici. At the same time, a waste-sugar 
source such as molasses or vegetable scraps 
is added as bacterial feed. This process con- 
tinues while the excreta collect and for at 
least another week after collection ceases. 

Second, about 50 grams of powdered 
charcoal — preferably ‘clean, from wood-gas 
stoves — is added with each bacterial appli- 
cation to absorb odour and bind nutrients. 
This biochar also creates microporous space 
for the lactic-acid bacteria to inoculate the 
faeces and lower their pH value. 

Finally, earthworms (vermiculture) com- 
post the collected and treated material aero- 
bically, further sanitizing the mixture over a 
period of three months to a year. The result- 
ing compost is safe for use with non-food 
‘industrial’ crops or forest trees. Two years 
of further processing creates compost that 
can be used on food crops. 

Terra preta sanitation systems can be 
modified to suit rural or urban sites, but 
it must be possible to seal collection and 
transport receptacles for the fermentation 
to work. Rural versions can be simple bucket 
toilets fitted with a urine diversion, so that 
faeces drop into one bucket and urine is 
piped into another. Urine can be lacto-fer- 
mented separately with the bacterial mix for 
a month or more to avoid nitrogen losses 
and odours, then used as fertilizer, pref- 
erably on non-food crops. Alternatively, 
the urine can be used immediately after 
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There is a social side to sustainable sanitation: safe and attractive toilet blocks are being b 


collection, mixed with at least five parts 
water. 

For urban settings we are designing a 
new family toilet flushed with very small 
amounts of water, sprayed either manually 
with a bottle or through a nozzle attached to 
sink pipes. This toilet mixes faeces and urine. 
Because moisture impedes composting and 
the high nitrogen content must be compen- 
sated for with a carbon source, half a kilo- 
gram of woody waste must be added later at 
the composting site. This unit has a tank big 
enough for at least a week’s worth of waste. 
The treated waste can then be collected and 
made into terra preta at a professional com- 
munal composting site. 

Our research shows that these toilet sys- 
tems and variations on the basic terra preta 
processes could work even in densely popu- 
lated cities — a revolution indeed. 


SHUNMUGA PARAMASIVAN 
Creating social 
inclusivity for 
girls and women 


Country director, Wherever the Need, 
Pondicherry, India 


Since 2007, I’ve helped to design and install 
toilets across India that tackle specific 
environmental and social challenges — 
including those relating to women, schools 
and slums. All are environmentally sus- 
tainable urine-diversion dry (composting) 
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uilt for girls at schools in India, raising attendance levels. 


toilets, which use little water and counter 
India’s serious faecal pollution problem (see 
page 185). 

The challenges we are up against are con- 
siderable. More than 626 million Indians 
practise open defecation daily, polluting 
groundwater and creating ideal breeding 
conditions for disease. Meanwhile, urban 
India’s pace of growth is incredible — by 
2030, estimates suggest that some 575 mil- 
lion people will be living in cities. One in 
four of the country’s city dwellers (93 mil- 
lion) lives in slums. 

But the social side of sustainability is 
as important as the environmental. Inad- 
equate or no sanitation facilities in schools 
mean that once girls reach puberty they 
stay at home when they are menstruating, 
disrupting their studies. And women tend 
to excrete when it is dark because they are 
seeking privacy — but that makes them 
vulnerable to attack, or even rape. So our 
designs factor in safety and ease of use for 
girls and women. 

The girls’ school sanitary blocks we have 
designed in Cuddalore, Tamil Nadu, have an 
exclusive wash room with a washing facility, 
napkin vending machine and an incinerator. 
They are painted in brilliant colours, fully 
enclosed and imaginatively laid out so that 
they are enjoyable to use. We educate the 
girls with a software programme underlin- 
ing the need for toilets, personal hygiene, 
and maintenance and management of the 
facility. Pupils play their part by helping to 
clean the toilets. 

School sanitation is a large part of our 
focus, especially in rural villages. Although 
the older generation uses entrenched 


methods of toileting, it is easier to convince 
children to use new ways. They then become 
agents of change for their families and 
communities. 

In 2008, we installed 150 toilets in the 
slums of Cuddalore, which presented dif- 
ferent issues. Family toilets with dedicated 
compartments to serve women and chil- 
dren are essential, but space was so limited 
we opted for mobile toilets. Two hundred 
families used the first unit; the second, rede- 
signed to resemble a Portaloo, serves 280 
families. We developed new single mobile 
ecosanitation units serving 15 to 20 fami- 
lies when the community asked for them. 
Mobile urinals were deployed in the most 
congested places, such as the hospital and 
harbour. 

Both human compost and urine as a fer- 
tilizer are in high demand from farmers 
and Wherever the Need is now storing and 
treating both for eventual sale. The proceeds 
are paying for the logistics involved in this 
operation. Next we hope to replicate the 
operation in Chennai. 


Building a coastal 
wetland in the 
heart of a city 


General manager, San Francisco Public 
Utilities Commission, California, and 
former chair of the Water Utility Climate 
Alliance. 


From the earliest planning stages, in 2009, 
for the new 13-storey headquarters of the 
San Francisco Public Utilities Commission, 
I knew the building would need to demon- 
strate the commission's ambitious sustain- 
ability goals — we needed to ‘walk the talk. 

Water reuse was a central concern, but 
we quickly discovered that many decen- 
tralized water-reuse technologies were too 
energy-intensive for our building’s energy 
budget. My team proposed that we pursue 
ecological sanitation methods inspired by 
wetlands, and challenge ourselves to defy 
conventional wisdom that the space such 
methods demand is too big for dense urban 
areas. 

In our low-energy solution, wastewater 
treatment, which is usually buried in the 
basement, is visible — in the atrium and 
from the pavements where thousands walk 
every day. And we've done it in one of the 
highest-density neighbourhoods in one 
of the highest-density cities in the United 
States. 

The building, which is expected to receive 
a leadership in energy and environmental 
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Tidal-flow wetlands 


> pis) Recirculation tank 


| 


Primary tank 


Lt Black water from toilets and grey water from sinks and showers 


is directed to the primary tank for filtration. 


Water is pumped to tidal-flow wetland cells, which are repeatedly filled and drained to create 
multiple tidal cycles each day, speeding up natural water-treatment processes. 


| Vertical-flow wetland cells provide a final level of treatment, removing 
aE ey remaining organic material and excess nitrogen. 


Mans treated water is disinfected with ultraviolet light and trace amounts of chlorine, 


and is ready for reuse in toilet flushing. 


design (LEED) platinum certification from 
the US Green Building Council in Washing- 
ton DC, contains a sustainable, integrated 
water-management system that will serve 
more than 1,000 staff and visitors daily. 
The Living Machine system treats all waste 
water in the building for reuse in flushing 
toilets (see “Water management wetlands 
style’). Rainwater is harvested from the 
roof and the child-care centre’s play area 
for landscape irrigation. By treating and 
reusing all waste water on site in a self-con- 
tained system, we avoid discharging usable 
water. 

Our system reduces total water use by 
about 65%, saving 3 million litres of drink- 
ing water per year. We anticipate eventually 
producing a surplus of 2.6 million litres of 
treated water for non-potable uses such as 
toilet flushing or park irrigation nearby. 

The process begins and ends with low- 
flow toilets. After flushing, black water from 
the toilets is combined with grey water, from 
the sinks and showers, in plumbing pipes. 
The mixed waste water is then directed to 
tanks beneath the pavement for filtration; 
filtered solids such as faeces are pumped 
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periodically to the municipal sewage sys- 
tem. From the tanks, the water is pumped 
into tidal-flow wetland ‘cells, set next to the 
pavement and in the lobby, and disinfected 
using ultraviolet light and trace amounts of 
chlorine. 

The cells are filled with gravel, bacteria 
and flowering grasses and plants — includ- 
ing Japanese sweet flag (Acorus gramineus) 
and western bleeding heart (Dicentra for- 
mosa). The plants were chosen for their root 
structure, which supports the establish- 
ment of treatment bacteria, and for resil- 
iency in San Francisco's climate and urban 
environment. 

The cells mimic the ecology and natu- 
ral water-treatment processes of coastal 
wetlands, and are alternately flooded and 
drained to create multiple tidal cycles per 
day, speeding up the natural water-treatment 
processes by increasing the influx of oxygen. 
The system is responsible for recycling some 
19,000 litres of water per weekday, but all 
people see is lush, vibrant plantings. It is a 
blend of function and aesthetics, as well as 
a demonstration of the ecological sanitation 


processes at work. = 
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The underground sewage system that residents installed in Orangi has vastly improved public health. 


Q&A Arif Hasan 


Architect of change 


Urban campaigner and architect Arif Hasan has been central to a sanitary revolution, 
transforming Orangi, Karachi, from informal settlement to thriving community. Using his 
technical know-how, residents built a sewage system, sparking vast social change. Now chair of 
Pakistan's urbanization task force, he discusses incorporating sustainable design into poor cities. 


What challenges did 
Orangi face when you 
arrived? 

In 1980, the lanes of 
Orangi, a katchi abadi 
or informal settlement 
housing a million 
people, were running 
with waste water and 
sewage, and infant mortality was 128 in 
1,000. The conditions stymied development: 
school attendance was down and trade dif- 
ficult to establish. The psychological effects, 
too, were severe, sapping the will for change. 
The lanes couldn't be used as public space 
and quarrels over sanitation issues were fre- 
quent. The wastewater also damaged house 
foundations and triggered unhealthy rising 
damp. 


How did you become involved? 

That year, the social scientist Akhtar 
Hameed Khan set up the Orangi Pilot Pro- 
ject [OPP] to understand local problems and 
develop models to overcome the constraints 
governments face in upgrading informal 
settlements. Khan encouraged people to 
build and pay for their own underground 
sewage system, at a cost of around US$30 a 
household. A year later, he needed an archi- 
tect. That was me. 
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How did you help? 

I proposed cheap, simple, local solutions: 
concrete-curing methods, casting cheap 
steel-shuttering manhole covers in situ, 
site-appropriate tools such as manual com- 
pactors, and surveys and maps. I also devel- 
oped rules of thumb for gradients, manhole 
and pipe sizes, survey procedures and 
inexpensive one-chamber septic tanks. 
These designs and methods, which chal- 
lenged conventional engineering standards, 
have stood the test of time. They also 
reduced costs by more than 40%. 


How did sanitation transform public health? 
By 2000, some 85% of Orangi had self-laid, 
self-financed sewer lines. The lanes are 
clean: children play in them, women sit and 
talk. Health indicators have improved, and 
by 1993 infant mortality had fallen to 37 in 
1,000. 


What are the social and economic changes? 
Literacy rates there are now among the 
highest in Pakistan. Socially and economi- 
cally, Orangi is much more connected to 
the rest of Karachi in diverse ways. Many 
people are white-collar workers; a substan- 
tial minority are professionals; women and 
entrepreneurs work in and service formal- 
sector industry. 
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Is Orangi a model for other cities? 

As an outgrowth of the process of building 
sanitation communally, residents became 
community activists and got involved in OPP- 
supported health, education, housing and 
micro-credit programmes. The architects and 
technicians who helped to initiate the project 
nowrun it, and have expanded it to cities and 
settlements all over Pakistan, often in collabo- 
ration with local governments and planning 
agencies. The OPP’s philosophy has had a 
major impact on the attitudes of professionals, 
academic institutions, government officials 
and international non-governmental organi- 
zations and agencies working in Pakistan. 


What other urban problems are you tackling? 
In 1992, a square metre of land on the periph- 
ery of Karachi cost 1.7 times the daily wage. 
Today, itis 40 times. Many cannot afford that, 
so housing is getting denser. In the inner city, 
the density is more than 4,000 people per hec- 
tare [compared with less than 150 in London]. 
Although water supply and electricity provi- 
sion have increased hugely in Karachi over the 
past 20 years, the sharing of toilets and kitch- 
ens has risen, along with the number of peo- 
ple per room. You can't live a comfortable life 
at such densities unless they are planned for. 


What are your solutions? 

With colleagues, I have developed design 
guidelines and incremental development pro- 
cesses to accommodate high densities in an 
environmentally friendly, affordable way. We 
hypothetically redesigned four different Kara- 
chi communities with computer modelling. 
Three redesigns showed that densities even 
higher than those prescribed by the Karachi 
Building Control Authority were workable. 


What other challenges does Karachi face? 

Many natural drainage channels and water 
bodies have been filled in to make way for 
elite coastal housing and new informal settle- 
ments upstream. So all of Karachi now floods. 
But other environmental issues are arising 
because of the new paradigm of Karachi as 
a ‘world-class city, fuelled by direct foreign 
investment. New infrastructure caters to the 
corporate sector, and its architectural style 
imitates the West. There is anew dependence 
on air conditioning despite local expertise in 
creating buildings comfortable without it. 


How has architecture in Karachi changed? 
When I began my practice in 1968, an archi- 
tect designed for the state, or for the rich. 
Today, clients vary hugely. This new archi- 
tectural world belongs to women, who make 
up 92% of the architecture and planning stu- 
dents at the University of Karachi. In many 
ways, the role of the architect is immense 
compared to before. m 
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Don’t forget health 
in sustainability talks 


Many wealthy nations are 
confronting the health and 
welfare implications of high 
unemployment, political 
instability and eroding social 
security. Such crises are part of 
our failure to move sustainability 
beyond rhetoric — a fact not 
adequately recognized by the 
Group of Eight (G8) community. 
The Rio+20 meeting in Brazil 
next week must accept that 
sustainable development and the 
health of populations depend on 
each other. 

Peak global health may 
already be here. Low- and 
middle-income populations 
endure a double burden 
from communicable and 
non-communicable diseases, 
while health inequity and 
undernutrition persist. Social 
and ecological factors such as 
climate change, energy and food 
insecurity, counterfeit drugs, 
antimicrobial and insecticide 
resistance and poverty all 
undermine health in our 
interconnected world. 

Many development specialists 
suggest that health has had 
its opportunity through the 
United Nations’ Millennium 
Development Goals, and that the 
strategic imperative must shift to 
agriculture or energy. But these 
are only two legs of the stool: 
health remains the third. 

When sustainability becomes 
areality, it will reduce many risks 
to health. For example, clean 
energy will eliminate the need 
to clear forests and use crops 
for biofuel, so people will be 
better fed. Improved agricultural 
practices will reduce exposure to 
the animal-to-human infections 
that are associated with land 
clearing. A green revolution 
will protect biodiversity by 
producing more food on land 
that is already cleared. Better 
economic, educational and social 
opportunities in rural areas will 
slow migration and alleviate the 
pressure on overloaded urban 
health resources. 


Bianca Brijnath Monash 
University, Australia. 

bianca. brijnath@monash.edu 
Anthony J. McMichael, Colin 
D. Butler National Centre for 
Epidemiology and Population 
Health, Australian National 
University, Australia. 


US policy should not 
shape collaborations 


The aims of the Global 
Research Council (GRC) 

to improve international 
scientific collaboration (Nature 
485, 427; 2012) have been 
enthusiastically endorsed by 
John Holdren, director of the 
White House Office of Science 
and Technology Policy, and 
Subra Suresh, head of the US 
National Science Foundation 
(see go.nature.com/rbiykb). We 
disagree with their statement 
that “good science anywhere 

in the world is good for science 
— and good for people — 
everywhere in the world”. 

The agreed GRC standards 
for merit-based peer review will 
improve research quality through 
collaborations between high- and 
middle-income countries, which 
have strong research systems. 
But the guidelines are unlikely 
to boost science capacity in low- 
income countries. 

Holdren and Suresh point 
out that US researchers could 
lose global funding if other 
governments do not review US 
proposals on their merit, and 
that the country’s economic 
interests could be harmed 
if colleagues do not respect 
confidentiality and intellectual 
property. Subsequently, the 
heads of publicly funded 
science agencies from 47 
countries agreed on the GRC’s 
Statement of Principles for 
Scientific Merit Review. 

It therefore seems that the 
GRC standards are transposing 
concern about intellectual 
property to the arena of peer 
review. This is reminiscent 
of the linking of intellectual- 
property rights with world trade 


law, which has created barriers 
to drug access in low-income 
countries. 

Instead of implementing an 
aspect of US foreign policy that is 
intended to protect US economic 
interests, the GRC should 
promote scientific research 
in low-income countries. 

Apart from implying that they 
will discuss the expansion of 
open-access publishing in these 
countries, GRC members seem 
to have paid little attention to 
this issue. 

Bridget Pratt, Bebe Loff 
Monash University, Australia. 
bridget.pratt@monash.edu 


Planetary boundaries 
concept is valuable 


As researchers studying the 
concept of setting environmental 
boundaries for planetary 
well-being (J. Rockstrém et al. 
Ecol. Soc. 14, 32; 2009), we 
disagree with Simon Lewis's 
suggestion that there are flaws 

in these proposals (Nature 485, 
417; 2012). 

Lewis contends that the 
concept’s focus is too narrow, that 
it doesnt distinguish between 
‘boundaries and ‘thresholds’ and 
that it should clarify the influence 
of scale on different problems. 
These are misunderstandings. 

Planetary boundaries are not 
fixed ‘supply limits, but are set 
within a safety margin around 
complex thresholds that are 
intertwined at regional and 
global scales. Ecosystem changes 
caused by nitrogen pollution, 
for example, are driven by 
global trade and cannot be 
uncoupled from climate change 
and alterations in land use. Also, 
investment in new phosphorus 
technologies can address the 
problems of both pollution and 
stock control. 

It is a mistake to see the setting 
of environmental boundaries as a 
call for multilateral negotiations 
around static limits. They 
are instead a bid to reform 
environmental governance 
at multiple scales. Scientific 


assessment, institutional 
interaction and social—ecological 
innovation will enable us to 
work more effectively towards 
planetary stewardship. 

Victor Galaz* Stockholm 
Resilience Centre, Stockholm 
University, Sweden. victor. galaz@ 
stockholmresilience.su.se 

*On behalf of 4 co-authors (for a 
fulllist, see go.nature.com/uct6dd). 


Combat the effects of 
Forest Code changes 


On 25 May, Brazil’s President 
Dilma Rousseff endorsed 
revisions to the country’s Forest 
Code. Thanks partly to an 
appeal from the international 
scientific community and non- 
governmental organizations, 
together with a petition of 
more than 2 million signatures, 
Rousseff also vetoed some of the 
code’s provisions. But more is 
needed. 

The revised code will continue 
to protect forests in crucial 
locations, such as along rivers, on 
hilltops and in coastal wetlands, 
as well as a specified percentage 
of those on private property (see 
go.nature.com/fzxmj5). There 
will be no amnesty for offenders 
who illegally logged forests in 
the past. 

I believe, however, that policies 
to reduce deforestation and foster 
sustainable development need 
to be strengthened to counter 
any potentially adverse effects of 
the new Forest Code. Payments 
for ecosystem services must 
be increased — for example, 
to expand the important work 
of the Forest Conservation 
Allowance Programme (Bolsa 
Floresta) in the Amazon. 
Ranching, agriculture and other 
economic activities should be 
confined to existing deforested 
areas. Environmental laws need 
stricter enforcement, and regular 
monitoring of forest areas is a 
priority. 

Alison G. Nazareno Federal 
University of Santa Catarina, 
Florianopolis, Brazil. 
alison_nazareno@yahoo.com.br 
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Learning about who we are 


Microbial inhabitants outnumber our body’s own cells by about ten to one. These residents have become the subject of 
intensive research, which is beginning to elucidate their roles in health and disease. SEE ARTICLES P.207 & P.215 


DAVID A. RELMAN 


he dawn of the twenty-first century has 

seen the emergence of a major theme in 

biomedical research: the molecular and 
genetic basis of what it is to be human. Sur- 
prisingly, it turns out that we owe much of our 
biology and our individuality to the microbes 
that live on and in our bodies — a realization 
that promises to radically alter the princi- 
ples and practice of medicine, public health 
and basic science. It is therefore appropriate 
that ever more research is focused on these 
microbes and their genes, which together 
are known as the human microbiome’. In 
this issue, the Human Microbiome Project 
Consortium”’ publishes the most exten- 
sive catalogue yet of organisms and genes 
pertaining to our microbiomes. 

The first observations of indigenous human 
microbiota were published more than 300 years 
ago, soon after the invention of the micro- 
scope. Today's view of the microbial world has 
been radically improved by DNA-sequencing 
technology. In the wake of the Human Genome 
Project, calls were issued’ for enhanced efforts 
to be made to characterize the ‘second human 
genome’ — the human microbiome. At the 
end of 2007, the US National Institutes of 
Health (NIH) launched the Human Micro- 
biome Project (HMP) and, in early 2008, the 
European Commission and China initiated 
the Metagenomics of the Human Intestinal 
Tract (MetaHIT) project. Other countries 
have begun similar ventures, motivated in 
part by an interest in better defining their 
biological heritage. 

Two studies, by Huttenhower et al.? 
(page 207) and Methé et al.’ (page 215), 
together with 15 other papers”® that are 
being published simultaneously elsewhere, 
comprise the first reports of the HMP Con- 
sortium research groups. The primary data, 
as described by Methé and colleagues’, 
were derived from samples collected from 
242 healthy adults in the United States, at 15 
(for males) or 18 (for females) body sites — 
from the skin, nose, mouth, throat, vagina and 
faeces (to represent the distal gastrointestinal 
tract). Each person was sampled up to three 
times over 22 months, generating a total of 
11,174 samples. 


194 | NATURE | VOL 486 | 14 JUNE 2012 


Anterior nares 

900 Buccal 

30,000 -——__| mucosa 
800 

Supragingival 70,000 

plaque 

1,300 

20,000 

Faeces 

(distal gut) 

4,000 -——— Posterior 

800,000 fornix 
300 
10,000 


Figure 1 | Variation in diversity. Researchers of 
the Human Microbiome Project are studying the 
microbial inhabitants of the human body, using 
samples taken from 242 healthy adults at 15 (for 
males) or 18 (for females) body sites — from the 
skin (four sites), mouth and throat (nine sites), 
vagina (three sites), nostrils and faeces (to represent 
the distal gastrointestinal tract). Huttenhower et al.’ 
and Methé et al.” have estimated the number of 
microbial species and their genes in these samples, 
and found substantial variation in microbial 
community composition at different body 

habitats. The two groups used different counting 
methodologies, and their numbers vary accordingly, 
such that exact figures are not available. However, 
crude estimations’ of number of microbial species 
(red) and number of microbial genes (blue) are 
shown for examples of: sites containing high species 
diversity, such as the gastrointestinal tract and teeth 
(supragingival plaque); sites with intermediate 
diversity, such as the inside of the cheek (buccal 
mucosa) and nostrils (anterior nares); and sites with 
lower diversity, such as the vaginal posterior fornix. 
The authors also found substantial variation in both 
the diversity and the composition of the microbial 
communities at different sites within the same 
general body region. 


The consortium researchers obtained the 
nucleotide sequence of the small-subunit ribo- 
somal RNA — a molecule found in all cellular 
life — from microorganisms in 5,177 of these 
samples*. These sequences are commonly 


© 2012 Macmillan Publishers Limited. All rights reserved 


used to infer the genetic relationships between 
organisms. The researchers also surveyed the 
genomes of the microbes in 681 of the samples” 
using a shotgun sequencing approach, which 
generates random sequences (reads) from a 
complex pool of DNA molecules. The reads 
are then assembled on the basis of overlap- 
ping sequence similarity, allowing researchers 
to identify genes and to predict the functions 
of the proteins that they encode. 

The investigators mapped their reads to 
all available microbial and viral genome 
sequences to assess community composition 
— the different types of microbes and their 
relative abundance — at the various body sites. 
The researchers also determined the whole- 
genome sequences of about 800 bacterial 
strains isolated from humans (from a planned 
total of 3,000); these sequences have been 
placed in public databases and can be used as 
reference genomes for comparative purposes. 
The consortium authors conclude” that they 
have identified the majority of the common 
microbial taxa and their genes present in these 
242 healthy humans. 

One of the great strengths of the HMP is 
that samples were collected simultaneously 
from multiple body habitats of the same indi- 
viduals. This allowed Huttenhower et al.’ to 
discover that taxonomic and genetic diver- 
sity were greatest in tooth and stool samples, 
intermediate in skin samples and on the inside 
surface of the cheek, and lowest in vaginal sam- 
ples**” (Fig. 1). The researchers report that 
each habitat is characterized by a small num- 
ber of highly abundant ‘signature’ taxa, but that 
the relative representation of taxa and genes 
in each habitat varies considerably between 
individuals. In most samples, high-abundance 
taxa are accompanied by low-abundance taxa 
from the same genus, suggesting that within- 
community niche specialization occurs. 
These findings confirm those of an earlier 
study*, which demonstrated that body habitat 
accounts for much of the variation in bacte- 
rial community composition. Although there 
is clear evidence for individuality in people’s 
microbiome compositions, the limited tem- 
poral scope of the HMP data set prevents a 
robust analysis of how these communities 
change over time. 

As shown previously for faecal samples’, 


the relative abundance of microbial genes 
associated with certain physiological path- 
ways varied less between samples from the 
same habitat than did the relative abundance 
of taxa. This suggests that there is functional 
redundancy between microbial community 
members. The prevalence of low-abundance 
genes varied the most between habitats, and 
Huttenhower and colleagues speculate that the 
functions of these genes correspond to body- 
niche-specific activities. Interestingly, the 
researchers also found evidence of taxonomic 
co-variation across sites in an individual, 
such as between the communities of the skin 
and the saliva. One possible explanation 
for this is that interactions between com- 
munity members are subject to selection 
pressure by host-specific, and host-wide, 
environmental factors. 

Studying the human microbiome has so 
far been a lesson in humility. Although the 
HMP and the MetaHIT project" are revealing 
vast amounts of previously uncharacterized 
microbial diversity within our ‘home turf; the 
functions of these communities remain largely 
unknown. Moreover, Methé et al. report that 
only 57% of the non-redundant gene families 
identified by the HMP and MetaHIT research- 
ers were detected by both groups, with 34% 
of the gene families being detected only in 
the HMP data and 10% only by the MetaHIT 
project. This raises questions about the 
representativeness of the people in the two pro- 
jects. In both cases, samples were taken from 
adults in developed nations who have relatively 
similar lifestyles and, in the case of the HMP, 
without inflammatory disease. Another recent 
report reveals" that populations living in less- 
developed regions of the world have markedly 
different microbiomes from those living in the 
United States. 

It is also important to consider the defini- 
tion of health. The most common cause for 
exclusion of people from the HMP was chronic 
gum disease, a condition that is increasingly 
regarded as ‘normal’ in developed countries. 
Furthermore, the prevalence of overweight 
and obese individuals continues to rise in many 
populations around the world, the chronic use 
of prescription drugs is becoming more com- 
mon, urbanization is increasing, and our natu- 
ral environment is changing in unexpected 
ways. Future studies of the human microbiome 
should accommodate such factors, which are 
likely to influence our microbial inhabitants. 

Many areas of human-microbiome research 
warrant further investigation, but viruses and 
small non-bacterial organisms such as fungi 
deserve special attention, as do questions 
regarding the functions of the microbiome. We 
are essentially blind to many of the services that 
our microbial ecosystems provide — and on 
which our health depends’* — and investi- 
gators desperately need new approaches for 
studying interactions between members of the 
microbial community and their human hosts. 


As Huttenhower and colleagues suggest”, the 
fact that some of the microbiome’s functions 
are likely to be performed by rare community 
members or to involve genes that are expressed 
at low levels will further complicate attempts to 
decipher their influence. 

Despite the valuable initial findings from 
the HMP and other projects, multiple lines of 
enquiry remain. Which factors are responsible 
for the day-to-day or longer-term variation in 
the composition and functions of a person’s 
microbiome? To what degree are such fac- 
tors intrinsic to the microorganisms, related 
to the host, or, indeed, stochastic? Which 
mechanisms regulate bacterial colonization or 
invasion of the human microbiome, how does 
the microbiome respond to disturbance, and 
to what degree does this response involve the 
propagation of surviving organisms versus new 
colonization from outside? What is the basis of 
resilience in the human microbiome, and can it 
be predicted and restored? Such questions sug- 
gest that the work of Methé et al., Huttenhower 
et al. and the numerous others studying the 
human microbiome is only just beginning. m 
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Majorana modes 


materialize 


Elusive theoretical fantasies known as Majorana modes have been observed ina 
hybrid semiconductor-superconductor system. These emergent exotica open up 
promising prospects for quantum computation. 


FRANK WILCZEK 


he condensed-matter physics com- 

munity has been galvanized by the 

apparent experimental discovery of 
Majorana modes, reported by Mourik et al. 
ina paper published in Science. These entities, 
whose existence had been predicted’ theo- 
retically, could become major components in 
quantum engineering. Specifically, they might 
provide the basic elements — the qubits — for 
a quantum computer*“, 

Over the past few decades, physicists have 
discovered that particle-like excitations called 
quasiparticles, found in condensed-matter sys- 
tems, can have strange, fascinating and pos- 
sibly useful properties, including fractional 
electric charge and unconventional quantum 
statistics. Majorana modes are a major addi- 
tion to this universe of exotica. 

To set the context and avoid misunderstand- 
ing, a brief reflection on the fundamental 
nature of quasiparticles is in order. The ele- 
mentary building blocks of material systems 
are not negotiable; electrons, photons and 
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atomic nuclei are what we get to work with. 
Furthermore, the basic interactions among 
those ingredients are known to high accuracy: 
Maxwell's electrodynamic and Schrédinger’s 
quantum equations rule. So how, within that 
familiar and reliable framework, can new and 
exotic ‘particles’ arise? 

Phonons are the original examples of quasi- 
particles. They were introduced* conceptu- 
ally by Albert Einstein in 1907. Two years 
previously, he had proposed the idea that 
light has particle-like properties, being cre- 
ated and transmitted in discrete units, namely 
photons. Generalizing that intuition, he sug- 
gested that the vibrations of solids come in 
discrete packets, which we now call phonons. 
Einstein used this notion to explain an other- 
wise mysterious deficit of vibrational motion 
in diamond at low temperatures. Einstein's 
work pre-dated the modern understanding 
of solids (for example, the atomic nucleus was 
discovered only in 1911), but its central con- 
cepts endure. ‘Holes’ are another crucial kind 
of quasiparticle’. As their name suggests, holes 
represent the absence of an electron where 
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50 Years Ago 


With the increasing use of 
irrigation in horticulture and 
agriculture and the developing 
interest in the effects of 
evapotranspiration, it is frequently 
necessary to locate the level ofa 
shallow water-table. This can be 
done inexpensively and effectively, 
using a small-diameter observation 
well and an electric probe ... 
Twelve observation wells and a 
probe... have been satisfactorily 
used by me, in soils ranging from 
fine clay to coarse gravel, for a 
period of eighteen months, in 
connexion with investigations 

of the water-table in the Thames 
flood-plain. It is almost certain, 
however, that, if necessary, these 
instruments would give trouble- 
free service for a much longer 
period. 

From Nature 16 June 1962 


100 Years Ago 


A Reuter message from New York 
reports that a steamer arrived 

at Seward (Alaska) on Sunday 
covered with volcanic dust from 
an eruption at Katmai, in the 
Aleutian Islands. It is stated by 
those on board that a steady 
stream of volcanic fragments and 
ash followed a terrific explosion, 
spreading over the countryside. 
The sun was obscured. Although 
the vessel was seventy miles 
distant, at four oclock on Thursday 
afternoon complete darkness set in 
and ash fell in a thick layer on the 
decks. It is estimated that volcanic 
ash covers three hundred square 
miles of fertile country. According 
to a telegram from Seattle 
(Washington State) the volcanic 
disturbance is rendering wireless 
telegraphic communication with 
Kadiak, Rospberry, and Afognac, 
three of the most important 
islands of south-western Alaska, 
impossible. 

From Nature 13 June 1912 
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one would normally be expected. They come 
in discrete, localized units and move as if they 
are particles. 

Those two classic examples are good illus- 
trations of noteworthy peculiarities of quasi- 
particles. Phonons, although they arise from 
the correlated motion of atomic nuclei, are 
particle-like objects that in no way resemble 
atomic nuclei. Holes, despite their origin in 
the correlated motion of electrons, have very 
different properties from electrons; indeed, 
they have the opposite electric charge. Quasi- 
particles transcend the elementary units 
from which they are built. They are emergent 
objects, embodying recognizable — that is, 
reproducible and long-lived — organizations 
of energy within structured materials. 

Different materials can support different 
kinds of quasiparticles. Of course, each mater- 
ial’s quasiparticles depend on the structure of 
the material and exist only within that mater- 
ial. Modern physicists can exploit their broad 
understanding of matter to design materials 
whose quasiparticles will have interesting or 
useful properties. Transistors, the building 
blocks of modern electronics, are an outstand- 
ing, but by no means singular, success story. 
They orchestrate the motion of electrons and 
holes to amplify and switch electronic signals. 

Majorana modes add a striking new vari- 
ation to the quasiparticle theme’. Appearing 
as solutions to equations of a type invented* 
by Ettore Majorana in 1937, Majorana modes 
represent forms of excitation predicted — and 
now observed — to be available to a few very 
special and specific kinds of quasiparticles. 
They were discovered as mathematical pos- 
sibilities within quantum field theory’, and 
were also found in theoretical models of pos- 
sible exotic superconductors and of a particu- 
lar form of the quantum Hall effect'®, which 
occurs in two-dimensional electron systems 
held at low temperatures and subjected to 
strong magnetic fields. 

In a brilliant paper, Alexei Kitaev then 
showed’ how Majorana modes could arise at 
the ends of superconducting wires. His reason- 
ing was (relatively) simple and transparent, 
and is at the heart of most subsequent propos- 
als for the realization of Majorana modes — 
including those that underlie the experiments 
of Mourik et al. and of other researchers’ 
who have also reported evidence of the phe- 
nomenon in different but related systems. 

To become acquainted with Majorana 
modes, it may be helpful to draw on their anal- 
ogy to a more familiar property possessed by 
many particles and quasiparticles — their spin. 
For example, an electron at a given position 
can be in either of two states, with spin ‘up’ or 
‘down. Spin provides new dynamic options 
for particles that have it; that is, the possibil- 
ity of spin-dependent interactions, including 
interactions that change the spin’s direction. 
Majorana modes generate, for quasiparticles 
that support them, a kind of emergent spin. It 
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is convenient to have a word for the concept 
‘quasiparticle that supports Majorana modes, 
and I have suggested* ‘mode-icule’ To describe 
the quantum state of a mode-icule, we must 
specify a two-component wavefunction, just 
as we do for a (spin-ful) electron. 

Building on previous theoretical designs 
for detecting Majorana modes, Mourik et al. 
studied the electrical properties of indium 
antimonide nanowires lying atop a substrate 
made of a conventional superconducting 
material. Indium antimonide is a semiconduc- 
tor that displays a strong quantum interaction 
called spin-orbit coupling: the motion of the 
electrons in the material is strongly coupled 
to their spin. Interaction with the substrate’s 
electrons induces superconductivity in the 
wire’s electrons as well, through a phenom- 
enon known as the proximity effect. Mourik 
et al. observed that, in such a hybrid semicon- 
ductor-superconductor device, electrons can 
tunnel into and out of the wire with no change 
in energy, at an energy independent of applied 
voltage. This shows, quite directly, that there 
are unusual quantum states, associated with 
the ends of the wire, which have the properties 
predicted for Majorana ‘spins. The sensitivity 
of the strength of this tunnelling to the magni- 
tude and direction of applied magnetic fields 
also matches theoretical expectations for the 
states associated with Majorana modes. 

The analogy of Majorana modes to spin 
proves inadequate when we come to describe 
systems of several identical mode-icules. Cru- 
cially, the emergent ‘Majorana states attached 
to separate mode-icules, unlike ordinary spins, 
are not entirely independent. On the contrary, 
those emergent states are, unavoidably, highly 
entangled. As a result, the quantum state of 
2n mode-icules is described by a 2”-dimen- 
sional wavefunction, whereas 2n independ- 
ent spins would require 2*” dimensions"®. 
This constraint arises because mode-icules 
are entities called non-Abelian anyons”’, and 
so have unconventional quantum statistics — 
as opposed to fermions (such as electrons) or 
bosons (such as photons). 

The wavefunction’s dimensionality, 
although drastically reduced by entanglement, 
remains exponentially large in n. Interchange 
of mode-icules is accompanied by the com- 
plicated, but perfectly predictable, evolution 
of their wavefunctions in a complex math- 
ematical region known as Hilbert space. This 
evolution of wavefunctions in Hilbert space 
embodies the elegant principles of Clifford 
algebra’””®. 

Theorists have developed ingenious and 
ambitious proposals for exploiting mode- 
icules, with their controllable entanglement, 
to enable quantum computation**. Until 
recently, their visions have been, at best, 
loosely tethered to laboratory reality. To do 
useful computations, we will need to create 
many mode-icules, and to develop the abil- 
ity to move their ‘spins’ around one another. 


14,15 


Arrays of superconducting wires of the type 
developed by Mourik and colleagues' could 
each support mode-icules at their ends. By 
linking them together, and applying suitable 
electromagnetic fields, it ought to be possible 
to substantiate the theorists’ dreams. Much dif- 
ficult terrain separates proof of mode-icules’ 
existence from the promised land of complex, 
scalable mode-icule circuits, but a beachhead 
has been established. m 
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Reconstructing the 
third dimension 


An approach to microscopy has been developed that can be used to determine, 
from a single imaging angle, both the position of a specimen’s individual atoms in 
the plane of observation and the atoms’ vertical position. SEE LETTER P.243 


DILANO SALDIN 


report an electron-microscopy technique 

that draws on an analogy with the method 
used in astronomy for determining distances 
to galaxies. The technique allows high- 
resolution, three-dimensional information 
to be obtained about a sample using only one 
viewing direction. 

A central observation in astronomy is that 
distant galaxies are moving away from us, and 
from each other, with a speed that is propor- 
tional to their distance from Earth. In other 
words, the farther away they are, the faster they 
are moving. Because the speeds of galaxies can 
be measured from the Doppler effect, which 
shifts the galaxies’ light to the red end of the 
electromagnetic spectrum, their distances can 
be determined using the constant of propor- 
tionality between speed and distance, known 
as the Hubble constant’. This central obser- 
vation, called Hubble’s law, is crucial evidence 
for the now accepted view that the Universe 
originated in a Big Bang, as a tiny, unimagina- 
bly dense entity that has been expanding ever 
since. What’s more, it provides astronomers 
with a neat way of determining the distances to 
objects, which would otherwise be impossible 


|: this issue (page 243)’, Van Dyck and Chen 


using only the objects’ observed positions in 
the sky. 

The problem of calculating the distance 
of an object from the plane of observation 
(the image plane) also exists in other fields of 
research. One such field is electron micros- 
copy. Because an electron microscope pro- 
duces a two-dimensional image of a sample, 
the lateral positions of the sample's atoms can 
be observed directly to high precision. How- 
ever, as with most images recorded on two- 
dimensional media, including conventional 
cameras, the vertical positions (heights) of the 
specimen’s constituents in the direction per- 
pendicular to the image plane are not easily 
found. 

One way to determine these heights involves 
tomographic electron microscopy, in which a 
three-dimensional image is reconstructed 
from projected two-dimensional images 
obtained from a range of viewing direc- 
tions. But for imaging at atomic resolution, 
this method would require the sample to be 
mechanically tilted with sub-angstrém pre- 
cision over a large angular range, a feat that 
has not yet been achieved. Moreover, a typical 
electron microscope records only the inten- 
sity of the electron wave that is scattered by 
the sample. The wave's phase provides much 
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of the important structural information about 
the sample, but the phase at different image 
points is not ordinarily recorded on an electron 
micrograph. However, it can be found using a 
process known as focal series reconstruction*’, 
or by creating interference between the scat- 
tered wave and a known (unscattered) refer- 
ence wave, as is done in an imaging technique 
called electron holography’. 

Van Dyck and Chen suggest that it is pos- 
sible to calculate the heights of atoms from 
the image plane using only one viewing direc- 
tion, if this phase can also be determined. The 
authors point out that the phase speed — the 
rate at which the phase changes with scattering 
angle — near the image of a particular atom 
is approximately proportional to the height 
of that atom. This approximate proportional- 
ity then allows the height to be determined, 
in rough analogy to Hubble's law. However, 
it should be noted that the proportionality is 
strictly valid only in the vicinity of the atom’s 
projected position in the image plane, and that 
its value is approximate because of aberrations 
in the microscope’s lenses. Therefore, the local 
and approximate constant of proportionality 
allows only the height of that particular atom 
to be measured. 

In applications of holography that have 
captured the public’s imagination, a ghostly 
three-dimensional image of a macroscopic 
object is reconstructed from the information 
given by a two-dimensional interference pat- 
tern (the hologram) of two laser beams. In this 
form of imaging, which was first proposed® by 
the physicist Dennis Gabor to image micro- 
scopic objects using electron waves instead of 
light beams, the macroscopic object may be 
regarded as an ensemble of point scatterers, 
and the fact that fringes in the interference pat- 
tern overlap is no obstacle to correctly recon- 
structing the spatial positions of these point 
scatterers. In this technique, the three-dimen- 
sional nature of the image automatically gives 
information about the third dimension. This 
is also the basis of atomic-source holography’, 
which has been used, for example, to deter- 
mine the positions (including the heights) of 
atoms adsorbed ona surface. 

Van Dyck and Chen propose that knowl- 
edge about the third dimension can instead be 
obtained by first reconstructing the normally 
invisible phase of the electron wave, and then 
exploiting the analogy with Hubble’s law. They 
demonstrate their technique for atoms in a sys- 
tem composed of two layers of graphene — a 
one-atom-thick, honeycomb-like lattice of 
carbon. But for the proposed technique to be 
more generally applicable, it will be necessary to 
extend the algorithm to reconstruct truly three- 
dimensional objects, as Gabor envisaged. m 
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A life or death switch 


The identification of two receptors for salicylic acid reveals how the hormone 
controls cell death and survival during plant immune responses, in tissues close 
to and distant from the site of infection. SEE LETTER P.228 


ANDREA A. GUST & THORSTEN NURNBERGER 


inherent feature of multicellular organisms. 

In plants, immune responses are activated 
when cellular receptors recognize microbial 
proteins (effectors) that betray the invader to 
the plant’s surveillance system’. This activa- 
tion requires the plant hormone salicylic acid, 
which is produced on microbial attack’. But 
how plants detect the hormone, and how it 
performs its immunity-associated functions, 
has remained unclear. On page 228 of this 
issue, Fu et al.* report the identification of 
two salicylic acid receptors in the model plant 
Arabidopsis thaliana, and provide a fascinat- 
ing explanation of how the hormone controls 


[ieee to microbial infection is an 


Infected area 


both cell death at the site of infection, and 
cell survival and immune activation in 
non-infected tissues. 

In plants, effector-triggered immunity (ETI) 
is often accompanied by programmed cell 
death (PCD) at the infection site. In addition to 
participating in local immune responses, PCD 
triggers long-lasting immunity against a broad 
spectrum of microbes throughout the plant — 
a protective mechanism referred to as systemic 
acquired resistance’. Salicylic acid participates 
in these immune responses by controlling the 
movement of a protein called NPR1 (non- 
expresser of pathogenesis-related genes 1) 
from the cell cytoplasm to the nucleus*. Once 
in the nucleus, NPR1 regulates the expres- 
sion of plant defence genes. Because mutant 


Non-infected area 
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Y 
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Figure 1 | Salicylic acid-mediated control of plant cell death and survival. On microbial infection, 
levels of the plant hormone salicylic acid increase, with its concentration decreasing gradually with 
increasing distance from the site of infection. Fu et al.* show that at high salicylic acid concentrations 

— typically found in infected areas of the plant — the receptor NPR3, which binds salicylic acid with 

low affinity, mediates degradation of the cell-death suppressor NPR1 (left panel), thereby favouring 
programmed cell death and local effector-triggered immunity. However, at the lower salicylic acid 
concentrations typically found in cells distant from the infection site, salicylic acid cannot bind to the 
low-affinity receptor NPR3, so cell death is blocked. In these cells, salicylic acid instead binds to the high- 
affinity receptor NPR4 (right panel), blocking degradation of NPR1, and thereby favouring cell survival 
and the expression of genes associated with systemic immunity. 
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plants that are insensitive to salicylic acid and 
plants that lack NPR1 exhibit similar immune 
defects, NPR1 was previously proposed’ to be 
asalicylic acid receptor. However, Fu et al.‘ did 
not detect any physical interaction between 
salicylic acid and NPRI, suggesting that NPR1 
does not serve this receptor function. 

What, then, could be the bona fide salicylic 
acid receptor mediating local and systemic 
immune activation in plants? The research 
group presenting the current paper has previ- 
ously shown’ that the proper functioning of 
NPRI requires that the protein is broken down 
by cellular protein-degradation machinery 
called the proteasome. So Fu and colleagues 
hypothesized that adaptor proteins that link 
NPRI to the proteasome might be receptors 
for salicylic acid. Two members of the NPR 
protein family, NPR3 and NPR4, exhibit a pro- 
tein-domain structure that is characteristic of 
such adaptor proteins, leading the authors to 
surmise that these proteins could be the protea- 
some adaptors that mediate NPR1 degradation. 
To validate this assumption, the researchers 
demonstrated that NPR1 is degraded by the 
proteasome in wild-type A. thaliana plants, but 
not in plants in which the genes for NPR3 and 
NPR&4 have both been knocked out. 

Fuand colleagues used in vitro protein—pro- 
tein interaction studies to assess the effect of 
salicylic acid on the formation of protein com- 
plexes between NPRI and NPR3 or NPR4. 
The authors found, surprisingly, that salicylic 
acid promotes NPRI-NPR3 interaction, but 
disrupts formation of the NPR1-NPR4 com- 
plex. Thus it seems that salicylic acid interacts 
physically with NPR3 and NPR4 in a recep- 
tor-like manner, but that this interaction has 
opposing effects on the adaptor proteins inter- 
actions with NPR1. The authors also found 
that although NPR3 and NPR4 both bind to 
salicylic acid, NPR4 binds with greater affin- 
ity than does NPR3. Hence, Arabidopsis plants 
contain two salicylic acid receptors, NPR3 and 
NPR4, which differ in their affinity for the hor- 
mone and in their roles in NPRI degradation, 
with NPR3 mediating NPR1 breakdown only 
in the presence of salicylic acid and NPR4 
only in its absence. 

What are the biological consequences of 
NPR3- or NPR4-mediated degradation of 
NPRI1? Fu and colleagues found that both 
local PCD and local ETI responses to bacterial 
infection were compromised in plants lacking 
the genes encoding both NPR3 and NPR4. The 
impairment of PCD, combined with the fact 
that NPR1 accumulates in the mutant plants 
(because it cannot be degraded) suggests that 
NPRI suppresses PCD in wild-type plants. 
Because salicylic acid levels are highest at 
infection sites®, Fu et al. propose that binding 
of the hormone to the lower-affinity recep- 
tor NPR3 mediates NPR1 degradation and 
de-repression of PCD and ETI in infected 
cells (Fig. 1). 

However, infection causes salicylic acid 


levels to increase systemically as well as locally, 
with its concentration decreasing gradually 
with increasing distance from the infection 
site®. In cells farther away from the infected 
area, salicylic acid levels are likely to drop 
below the concentration required for NPR3- 
mediated NPR1 degradation and, thus, PCD. 
Fu and colleagues propose that, in these cells, 
salicylic acid binds instead to the higher- 
affinity receptor NPR4, which inhibits NPR4- 
mediated NPR1 degradation and thereby 
facilitates NPR1 accumulation, cell survival 
and subsequent salicylic acid-dependent gene 
expression (Fig. 1). Consistent with this model, 
the authors showed that NPR1 levels are lowest 
in cells undergoing PCD and highest in cells 
surrounding PCD lesions. 

Several mutant plants that exhibit runaway 
PCD have been identified’, and the question of 
how plants control PCD has been a major area 
of research. Fu and colleagues’ findings provide 
compelling evidence that salicylic acid acts as 
an immune signal to determine cell fate in plant 
immunity. Studies investigating plant proteins 
associated with abnormal PCD should now 
examine whether these proteins might con- 
tribute to the functionality of NPR3 or NPR4. 

Salicylic acid is the only major plant hor- 
mone for which the receptor has remained 
elusive. Fu and colleagues’ demonstration that 
the two salicylic acid receptors control dis- 
tinct defence strategies by de-repressing local 
cell death and immunity at the infection site 
in one case, and systemic immunity remote 
from the infection site in the other, is remi- 
niscent of the de-repression of physiological 
programs enacted by other plant hormones, 
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such as auxin, gibberellic acid and jasmonic 
acid’. However, NPR3 and NPR4 are the first 
plant-hormone sensors for which differences 
in binding affinity have been shown to medi- 
ate differential control of plant responses. 
Because most plant hormones regulate 
multiple aspects of plant life, it is certainly 
possible that other plant hormone receptors 
use a comparable mode of action. Consistent 
with this idea is the recent identification" of 
auxin hormone-binding proteins that have 
different ligand affinities, suggesting that 
plants also have means for differential sensing 
of auxin’. m 


Andrea A. Gust and Thorsten Niirnberger 
are in the Centre of Plant Molecular Biology, 
Department of Plant Biochemistry, University 
of Tiibingen, D-72076 Tiibingen, Germany. 
e-mails: nuernberger@uni-tuebingen.de; 
andrea. gust@zmbp.uni-tuebingen.de 


1. Dodds, P. N. & Rathjen, J. P. Nature Rev. Genet. 11, 
539-548 (2010). 

2. Jones, J. D. & Dangl, J. L. Nature 444, 323-329 
(2006). 

3. Spoel, S. H. & Dong, X. Nature Rev. Immunol. 12, 
89-100 (2012). 

4. Fu, Z.Q. et al. Nature 486, 228-232 (2012). 

5. Mou, Z., Fan, W. & Dong, X. Cel/ 113, 935-944 

(2003). 

6. Cao, H., Glazebrook, J., Clarke, J. D., Volko, S. & 

Dong, X. Cel! 88, 57-63 (1997). 

7. Spoel, S.H. et al. Cell 137, 860-872 (2009). 

8. Enyedi, A. J., Yalpani, N., Silverman, P. & Raskin, |. 
Proc. Natl Acad. Sci. USA 89, 2480-2484 (1992). 

9. Lam, E. Nature Rev. Mol. Cell Biol. 5, 305-315 

(2004). 

10.Robert-Seilaniantz, A., Grant, M. & Jones, J. D. Annu. 
Rev. Phytopathol. 49, 317-343 (2011). 

11.Calder6n Villalobos, L. |. et al. Nature Chem. Biol. 8, 
477-485 (2012). 


An infant giant 


Spectroscopic measurements of a galaxy that shines brightly at submillimetre 
wavelengths place it in the middle of a nascent galaxy cluster at a scant 
one billion years after the Big Bang. SEE LETTER P.233 


ALBERTO D. BOLATTO 


s in many scientific fields, but perhaps 
A= than in most, advances in tech- 

nology frequently drive progress in 
astronomy. The deployment of a new instru- 
ment or a capability can afford a completely 
different view of the Universe, opening a 
window onto an aspect of reality that was pre- 
viously unsuspected or only theorized. One 
such advance occurred in the late 1990s with 
the advent of large-format submillimetre- 
wave cameras — particularly SCUBA, the 
Submillimetre Common-User Bolometer 
Array mounted on the James Clerk Maxwell 
Telescope in Mauna Kea, Hawaii — and the 
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resulting discovery of a class of luminous 
yet elusive galaxies. On page 233 of this 
issue, Walter et al.’ describe an analysis that 
advances our understanding of this family of 
galaxies and closes a chapter on the story of 
their origins. 

Early on, images taken with SCUBA 
revealed a population of galaxies that shine 
brightly at submillimetre wavelengths, prosai- 
cally named submillimetre galaxies (SMGs). 
One of the first deep SCUBA images to 
uncover SMGs was obtained’ in 1998 through 
observations of the Hubble Deep Field (Fig. 1), 
which is perhaps the most emblematic patch 
of sky observed by the Hubble Space Tele- 
scope. Surprisingly, SMGs were very faint, or 
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even invisible, in optical images of 
this and other fields. Moreover, the 
SCUBA data provided no spectro- 
scopic information, and thus no 
direct knowledge of the objects’ 
distance or redshift, which are nec- 
essary to establish most of a galaxy’s 
properties. What were these myste- 
rious submillimetre sources? 

The poster child for this ques- 
tion is HDF 850.1, the brightest 
SMG in the Hubble Deep Field. 
Walter and colleagues determine 
the galaxy’s redshift, providing an 
important piece of the SMG puzzle. 
Analyses of the galaxies present in 
optical images at positions coinci- 
dent, within errors, with those of 
SCUBA sources suggested from the 
outset that these sources are located 
at cosmological distances from 
Earth, corresponding to a time 
when the Universe was a fraction 
ofits present age. The analyses also 
indicated that SMGs are tremen- 
dously powerful systems — with 
energy outputs of the order of 
10” times that of the Sun”’. Such 
luminosities imply that, if they are 
powered mostly by star forma- 
tion, SMGs are forming stars at 
rates several hundred times that 
of our Milky Way. Yet dust hides 
these starbursts from optical observations, 
which could therefore be missing an impor- 
tant fraction of the star-formation activity 
occurring in the early Universe’. 

Although there are indirect indications of 
how far away SMGs are located, spectroscopy 
of these sources — and so direct measurement 
of their redshifts — has remained difficult. 
One successful indirect technique relies on 
the correlation between an object's radio and 
far-infrared luminosities, and uses radio 
interferometers to pinpoint its precise sky 
coordinates. Measurement of the source’ red- 
shift then requires long-exposure spectroscopy 
of its optical counterparts using the largest 
optical telescopes. The results from studies 
based on this indirect method indicate that 
most SMGs exist at redshifts of about 1.5-3, 
around the peak of star-formation activity in 
the Universe’. Because of the built-in obser- 
vational selection effects of this technique, the 
question of how many SMGs lie at redshifts 
beyond 3 is still open. SMGs could shine in the 
submillimetre range at much greater distances 
than in the radio range, and so this approach 
might fail to detect distant SMGs. 

In their study, Walter and collaborators 
used the IRAM Plateau de Bure Interferom- 
eter near Montmaur, France, to obtain direct 
spectroscopic measurements of HDF 850.1, 
pinpointing its redshift at 5.183 — a scant 
1.1 billion years after the Big Bang. The tech- 
nique relies on detecting the spectral lines 
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Figure 1 | Structure in the infant Universe. The brightest source in this 
submillimetre-wavelength image of the Hubble Deep Field, obtained using 
the SCUBA camera’, is a galaxy called HDF 850.1. Walter and colleagues’ 
spectroscopic observations’ indicate that this object is at the centre ofa 
nascent galaxy cluster at redshift 5.183. White and yellow denote bright 
sources; red represents fainter sources. The circle’s diameter is about 

200 arcseconds (one-tenth of a full Moon). 


associated with two rotational transitions 
of the carbon monoxide (CO) molecule to 
unambiguously determine the redshift. The 
method has recently become feasible thanks 
to the increasing information-transmission 
speed and computing power of (sub) milli- 
metre-wave interferometric spectrographs, 
which can provide instantaneous spectra 
spanning tens of gigahertz. Determination 
of redshifts at millimetre and submillimetre 
wavelengths is a powerful tool, and will be used 
increasingly for the study of the early Universe 
with new advanced observatories, particularly 
the Atacama Large Millimetre Array in Chiles 
Atacama Desert. 

The authors’ spectroscopic analysis also 
revealed a spectral line associated with ionized 
carbon. This detection not only confirms the 
SMG redshift, but also supports the idea that 
the radiation previously detected by SCUBA 
stems from a monstrous starburst producing 
stars 800 times faster than the present-day 
Milky Way. Yet the source is completely invis- 
ible in the deepest optical images, in which 
the radiation from young stars is not detected 
because of obscuration by dust present in the 
source. This obscuring dust absorbs ultraviolet 
and optical light and re-emits it at far-infrared 
wavelengths, which are subsequently ‘red- 
shifted’ (stretched) by the expansion of the 
Universe and so fall in the submillimetre band 
observed by SCUBA. 

The authors find that the abundance of 
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dust relative to gas in HDF 850.1 is 
similar to that found in the Milky 
Way. Dust is predominantly com- 
posed of heavy elements (elements 
other than hydrogen and helium), 
most of which originate from stel- 
lar explosions known as superno- 
vae, which are caused by the death 
of massive stars. The unexpectedly 
high abundance of dust relative to 
gas at this early cosmic epoch indi- 
cates that heavy-element enrich- 
ment of the interstellar medium 
of HDF 850.1 proceeded at an 
extremely fast pace. The mecha- 
nism responsible for creating this 
much dust so quickly is not entirely 
understood. 

A remarkable finding is that 
HDF 850.1 is not alone. It is sitting 
in the middle of a clear ‘overdensity’ 
of sources that also includes a rare 
quasar (an active galactic nucleus 
powered by a supermassive black 
hole). Twelve other sources have 
been identified within a redshift 
interval of about 0.03 from the 
galaxy’s location. This interval cor- 
responds to a local depth in the sky 
of about 2.5 megaparsecs (8 mil- 
lion light years), which is also the 
approximate extent of the structure 
in the sky. This overdensity is one of 
two’ spectroscopically verified candidate pro- 
toclusters at such high redshift, and thus repre- 
sents one of the earliest structures known in the 
Universe — occurring after the epoch at which 
relic radiation from the Big Bang, known as 
the cosmic microwave background radia- 
tion, emerged. This discovery lends further 
support to the idea that the most luminous 
SMGs are signposts for rare overdensities in 
the overall structure of the Universe. Such 
extremely overdense regions are thought 
to have evolved into present-day massive 
clusters, and the brightest SMGs in these 
regions probably became today’s giant ellip- 
tical galaxies at the centres of the clusters’ 
potential well°. The conclusion of the story 
for HDF 850.1, then, is that this SMG is 
nothing less than an infant giant undergoing 
a growth spurt. = 
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Visualizing heavy fermions emerging in a 
quantum critical Kondo lattice 


Pegor Aynajian'*, Eduardo H. da Silva Neto!*, Andras Gyenis', Ryan E. Baumbach?, J. D. Thompson’, Zachary Fisk*, Eric D. Bauer? 


& Ali Yazdani! 


In solids containing elements with f orbitals, the interaction between f-electron spins and those of itinerant electrons leads 
to the development of low-energy fermionic excitations with a heavy effective mass. These excitations are fundamental to 
the appearance of unconventional superconductivity and non-Fermi-liquid behaviour observed in actinide- and 
lanthanide-based compounds. Here we use spectroscopic mapping with the scanning tunnelling microscope to detect 
the emergence of heavy excitations with lowering of temperature in a prototypical family of cerium-based heavy-fermion 
compounds. We demonstrate the sensitivity of the tunnelling process to the composite nature of these heavy quasiparticles, 
which arises from quantum entanglement of itinerant conduction and f electrons. Scattering and interference of the 
composite quasiparticles is used to resolve their energy-momentum structure and to extract their mass enhancement, 
which develops with decreasing temperature. The lifetime of the emergent heavy quasiparticles reveals signatures of 
enhanced scattering and their spectral lineshape shows evidence of energy-temperature scaling. These findings 
demonstrate that proximity to a quantum critical point results in critical damping of the emergent heavy excitation of 


our Kondo lattice system. 


A local magnetic moment occurs when a strongly interacting 
quantum state, such as an atomic d or f orbital, cannot be doubly 
occupied owing to strong on-site Coulomb repulsion’. In the presence 
of a dilute concentration of such magnetic moments in a metal, spin- 
flip scattering of conduction electrons from these local moments 
results in their collective magnetic screening below a characteristic 
temperature called the Kondo temperature, Tx (Fig. 1a)’. In materials 
where local moments are arranged in a dense periodic array, the 
so-called Kondo lattice, the deconfinement of localized orbitals through 
their hybridization with the conduction electrons results in composite 
low-energy excitations with a heavy effective mass (Fig. 1b). Tuning the 
hybridization between f orbitals and itinerant electrons can destabilize 
the heavy Fermi-liquid state towards an antiferromagnetically ordered 
ground state at a quantum critical point (QCP)**. In proximity to such 
a quantum phase transition—between itinerancy and localization of f 
electrons—many heavy-fermion systems exhibit unconventional 
superconductivity at low temperatures (Fig. 1c)’. 

Thermodynamic and transport studies have long provided evidence 
for heavy quasiparticles, their unconventional superconductivity and 
non-Fermi-liquid behaviour in a variety of material systems”™. 
However, the emergence of a coherent band of heavy quasiparticles near 
the Fermi energy in a Kondo lattice system is still not well understood’*"”. 
Part of the challenge has been the inability of spectroscopic measure- 
ments to probe the development of heavy quasiparticles with lowering of 
temperature and to characterize their properties with high energy reso- 
lution. Such precise measurements of heavy-fermion formation are not 
only required for understanding the nature of these electronic excitations 
close to quantum phase transitions’’, but are also critical to identifying 
the source of unconventional superconductivity near such transitions. 


Composite heavy-fermion excitations 
The emergence of composite heavy fermions in a Kondo lattice can be 
considered as a result of the hybridization of two electronic bands: one 
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Figure 1 | Tunnelling into a Kondo lattice. a, b, Schematic representations ofa 
single-impurity Kondo effect (a) and a Kondo lattice (b), illustrating the screening 
(hybridization) of the local moments (red arrows) by the itinerant conduction 
electrons (green arrows). c, Schematic phase diagram of heavy-fermion systems 
where the electronic ground state can be tuned from antiferromagnetism (AFM) 
with localized f moments to a heavy Fermi liquid (HFL) with itinerant f electrons. 
At low temperatures, superconductivity (SC) sets in near the quantum critical 
point (QCP) from a non-Fermi liquid (NFL). Insets show cartoon pictures of the 
spin fluctuations in the different phases. d, Bare electronic bands (dashed lines; 
spd and f) and hybridized heavy-fermion bands (HF; solid lines) displaying a 
direct (2v) and an indirect (4),) hybridization gap. e, Tunnelling spectrum 
computed from the hybridized band structure in d for a tunnelling ratio t;/ 

t. = —0.025, showing sensitivity to the direct hybridization gap (2v). f, Similar 
spectra computed with f;/t, = —0.37, showing sensitivity to the indirect gap (4)). 
See Supplementary Information section I for details of the model. 
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dispersing band due to conduction electrons and one weakly dispers- 
ing band originating from localized f electrons (dashed lines in 
Fig. 1d). This hybridization generates low-energy quasiparticles that 
are a mixture of conduction electrons and f electrons with a modified 
band structure characterized by the so-called direct (2v) and indirect 
(4),) hybridization gaps, as shown in Fig. 1d‘”’’. Various theoretical 
approaches, including several numerical studies, reproduce the generic 
composite band structure shown in Fig. 1d”°**. Recent theoretical 
modelling has also shown that tunnelling spectroscopy can be a 
powerful probe of this composite nature of heavy fermions*>”*. 
Depending on the relative tunnelling amplitudes to the light conduction 
(t,) or to the heavy f-like (t) components of the composite quasiparticles, 
and due to their interference, tunnelling spectroscopy can be sensitive to 
different features of the hybridized band structure. Figure 1d—-f shows 
examples of model calculations (see Supplementary Information section 
I) illustrating the sensitivity of the spectra to predominant tunnelling to 
the light (Fig. le) or heavy (Fig. 1f) electronic states. 

Recent advances in the application of scanning tunnelling micro- 
scopy (STM) to heavy fermions are providing a new approach to 
examining the correlated electrons in these systems with high energy 
and spatial resolutions. STM and point-contact experiments on 
heavy-fermion compounds have shown evidence for hybridization 
of the conduction electrons with the f orbitals and have been used 
to probe the so-called hidden order phase transition involving heavy f 
electrons in URu,Si, (refs 29-32). Sudden onset of the hidden order 
phase seems to give rise to strong modification of the band structure in 
URu,Siz as detected by STM measurements*”*'. However, these 
changes are correlated with the phase transition into the hidden order 
at 17.5 K rather than being the generic physics of heavy Fermi liquids 
that should appear at higher temperatures and evolve smoothly 
with lowering of temperature. Direct experimental observation of 
the gradual formation of heavy quasiparticles with decreasing tem- 
perature and evidence of their composite nature, which is ubiquitous 
to all heavy fermions, as well as examination of their properties in 
proximity to QCPs, have remained out of the reach of STM and other 
spectroscopic measurements. 


CeMIns as a model heavy-fermion system 

To provide a controlled study of the emergence of heavy-fermion 
excitations within a Kondo lattice system that can be tuned close to 
a QCP, we carried out studies on the CeMIn; (with M = Co, Rh) 
material system. These so-called 115 compounds (the chemical 
formula could be written Ce,M,In;) offer the possibility of tuning 
the interaction between the f orbitals of Ce and the itinerant spd 
conduction electrons using isovalent substitutions at the transition 
metal site within the same tetragonal crystal structure. Consequently, 
the ground state of this system can be tuned (in stoichiometric com- 
pounds) between antiferromagnetism, as in CeRhIn; (Néel temper- 
ature Ty = 3.5K), to superconductivity, as observed in CeColIns 
(superconducting transition temperature T,= 2.3K) and CelrIn, 
(T- = 0.4K)’. Previous studies indicate that CeColns is very close to 
a QCP***°, whereas CeRhIn, can be tuned close to this transition with 
application of pressure”*’. These experiments confirm that super- 
conductivity in the 115 system emerges at low temperatures close to 
a QCP from heavy low-energy excitations that developed at high 
temperature””*®. More specifically, transport studies show a drop in 
the electrical resistivity of CeCoIn; around 50K (which has been 
interpreted as evidence for the development of a coherent heavy 
quasiparticle band) followed by a T-linear resistivity at lower temper- 
ature (above T.)** — a behaviour that has been associated with the 
proximity to the QCP. Quantum oscillations and thermodynamic 
measurements find a heavy electron effective mass (10-50 mo, where 
Mp is the bare electron mass) for CeCoIns, whereas in the same tem- 
perature range the f electrons in CeRhIn; are effectively decoupled 
from the conduction electrons*”°. 
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Figure 2 shows STM images of a single crystal of CeColIn; that has 
been cleaved in situ in our variable temperature ultrahigh-vacuum 
STM. In this family of compounds, the cleaving process results in the 
exposure of multiple surfaces terminated with different chemical 
compositions. The crystal symmetry necessarily requires multiple 
surfaces for cleaved samples, as no two equivalent consecutive layers 
occur within the unit cell. Therefore breaking of any single chemical 
bond will result in different layer terminations on the two sides of the 
cleaved sample. Experiments on many cleaved samples have revealed 
three different surfaces, two of which are atomically ordered (termed 
surfaces A and B in Fig. 2a, b) with a periodicity of ~4.6 A corres- 
ponding to the lattice constant of the bulk crystal structure, whereas 
the third surface (termed surface C, Fig. 2b) is reconstructed. 
Comparison of the relative heights of the sub-unit-cell steps between 
the different layers (Fig. 2c, d) to the crystal structure determined 
from scattering experiments*' suggests that exposed surfaces A, B 
and C correspond to the Ce-In, Co and In, layers, respectively. 
Experiments on CeCo(Ino 99g5Hgo,0015)5 and CeRhIn, reveal similar 
results, where cleaving exposes the corresponding multiple layers in 
those compounds (see Supplementary Information section II). Hg 
defects in CeColn; at this concentration have negligible influence 
on its thermodynamic and transport properties and are introduced 
for the scattering experiments described below”. 


Signatures of hybridization and composite excitations 

Spectroscopic measurements of CeColIn; show the sensitivity of the 
tunnelling process to the composite nature of the hybridized heavy- 
fermion states. Tunnelling spectra on surface A (identified as the 
Ce-In layer) of CeCo(Inp99gsHgo.0015)5 show that on cooling the 
sample dramatic changes develop in the spectra in an asymmetric 
fashion about the Fermi energy (Fig. 3a). (The same behaviour is also 
observed in CeCoIns; see Supplementary Information section III.) 


Surface A Surface C 


Surface B 


700. 200 300 
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Figure 2 | STM topographies on CeCoIn;. a, Constant current topographic 
image (+200 mV, 200 pA) showing an atomically ordered surface (termed 
surface A) with a lattice constant of ~4.6 A.b, Topographic image (—200 mV, 
200 pA) showing two consecutive layers: a distinct atomically ordered surface 
(termed surface B, lattice constant ~4.6 A, dark blue) and a reconstructed 
surface (termed surface C, light blue). Insets in a and b show magnified images 
(45 X 45 A?) of the three different surfaces. c, Constant current topographic 
image (— 150 mV, 365 pA) displaying all three surfaces (the derivative of the 
topography is shown to enhance contrast). d, A line section through the 
different surfaces (solid line in c) showing the relative step heights (left) 
compared to the bulk crystal structure (right). 
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Figure 3 | Composite nature of heavy-fermion excitations. a Averaged 
tunnelling spectra (—150 mV, 200 pA) measured on surface A of 

CeCo(Ino 99gsHo.0015)s for different temperatures (T, in K; solid lines) and on 
the corresponding surface A of CeRhIn; at 20 K (dashed line). b, Averaged 
tunnelling spectra (—150 mV, 200 pA) measured on surface B of 

CeCo(Ino 99g5Ho.0015)s for different temperatures (T, in K; solid lines) and on 
corresponding surface B of CeRhIns at 20 K (dashed line). ¢, d, Tunnelling 
spectra computed for t/t, = —0.01 (c) and t;/t. = —0.20 (d) for selected values 
of yf (in meV; solid lines). See Supplementary Information section I for details 
of the model. 


The redistribution of the spectra observed on this surface is consistent 
with a tunnelling process that is dominated by coupling to the light 
conduction electrons and displays signatures of the direct hybridiza- 
tion gap (2v) experienced by this component of the heavy-fermion 
excitations (for example, see Fig. 1d, e). In contrast to these observa- 
tions, similar measurements on the corresponding surface of CeRhIn; 
show spectra that are featureless in the same temperature range 
(Fig. 3a, dashed line) and are consistent with the more localized nature 
of the Ce f orbitals in CeRhIn; as compared to CeColn;. The hybrid- 
ization gap structure in CeColn; is also centred above the chemical 
potential (8 meV, see Fig. 3a), which makes access difficult for angle- 
resolved photoemission experiments** *—the typical technique used 
for probing electronic band structure in solids. 

The composite nature of the heavy-fermion excitations manifests 
itself by displaying different spectroscopic characteristics for tunnelling 
into the different atomic layers. Figure 3b shows spectra measured on 
surface B (identified as Co) of CeCo(Ino908sHgo.0015)s that look very 
different from those measured on surface A (Fig. 3a). In the temperature 
range where spectra on surface A (Fig. 3a) develop a depletion of 
spectral weight near the Fermi energy, surface B shows a sharp enhance- 
ment of spectral weight within the same energy window (Fig. 3b). With 
further lowering of temperature, the enhanced tunnelling on surface B 
evolves into a double-peak structure. As a control experiment, measure- 
ments on the corresponding surface in CeRhIns, once again, display 
no sharp features in the same temperature and energy windows 
(Fig. 3b, dashed line). The spectroscopic features of surface B of 
CeCo(Ino 908sH¥o.0015)s display the characteristic signatures of 
dominant tunnelling to the f component of the heavy quasiparticles, 
which reside near the Fermi energy and are expected to display the 
indirect hybridization gap (4),; see Fig. 1d, f). 

Modelling the tunnelling to composite heavy excitations can repro- 
duce our spectroscopic measurements on the two different atomically 
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ordered surfaces of CeCo(Ino99gsHgo.o01s)s. Following recent 
theoretical efforts***’, we compute spectroscopic properties of a 
model band structure in which a single hole-like itinerant band of 
spd-like electrons hybridizes with a narrow band of f-like electrons 
(see Supplementary Information section I for details of the model). 
The results of our calculations (Fig. 3c, d) are sensitive to the ratio of 
tunnelling (t//t.) into the heavy f states to tunnelling into the light 
conduction band—a behaviour that explains the differences between 
the tunnelling processes on the different cleaved surfaces (Fig. 3a, b). 
Although naively one would expect that tunnelling to the heavy 
excitations would be more pronounced on the Ce-In layer, recent 
first principles calculations show that the amplitude of the hybridiza- 
tion of the f states with the out-of-plane spd electrons can be remarkably 
larger than the amplitude of the hybridization with the in-plane spd 
electrons”'. 


Visualizing quasiparticle mass enhancement 


To directly probe the energy-momentum structure of heavy 
quasiparticles in the 115 material systems, we have carried out spec- 
troscopic mapping with the STM that enables us to visualize the 
scattering and interference of these quasiparticle excitations from 
impurities or structural defects. Elastic scattering of quasiparticles 
from these imperfections gives rise to standing waves in the conduc- 
tance maps at wavelengths corresponding to 2n/q, where q = k; — k; 
is the momentum transfer between initial (k,) and final (k,) states at 
the same energy. We expect that those q with the strongest intensity 
connect regions of high density of states on the contours of constant 
energy, and hence provide energy-momentum information about 
the quasiparticle excitations. We characterize the scattering q using 
discrete Fourier transforms (DFTs) of STM conductance maps 
measured at different energies. The presence of Hg substitutions in 
CeCo(Ino 998sH0.0015)s provides a sufficient number of scattering 
centres to enhance signal to noise ratio for such quasiparticle 
interference (QPI) measurements. 

Figure 4a shows examples of energy-resolved STM conductance 
maps on surface A of CeCo(Ino 99gsHo.0015)5 measured at 20 K; the 
maps display signatures of scattering and interference of quasiparticles 
from defects and step edges. These conductance maps show clear 
changes of the wavelength of the modulations as a function of energy. 
Perhaps the most noticeable are the changes around each random 
defect (see Supplementary Information section II for the correspond- 
ing STM image showing the location of the Hg defects). Figure 4b 
shows DFTs of such maps; sharp non-dispersive Bragg peaks (at the 
corners, (+2n/a, 0), (0, +21/a)) corresponding to the atomic lattice are 
seen, as well as other features (concentric square-like shapes) that 
rapidly disperse with energy, collapse (Fig. 4b; 0 meV) and then dis- 
appear (Fig. 4b; 9 meV) near the Fermi energy. We have carried out 
such measurements both at low temperatures (20 K, Fig. 4b), where the 
spectrum shows signatures of hybridization between conduction elec- 
trons and f orbitals, and at high temperatures (70K, Fig. 4c), where 
such features are considerably weakened (for example, Fig. 4c; 2 meV, 
10 meV). As a control experiment, we have also carried out the same 
measurements on the corresponding surface of CeRhIns (Fig. 4d), for 
which signatures of heavy electron behaviour are absent (for example, 
Fig. 3a) in the same temperature window (20K). Although 
understanding details of the QPI in Fig. 4 requires detailed modelling 
of the band structure of the 115 compounds, the square-like patterns 
observed in the data correspond to scattering wavevectors that can be 
identified from the calculated local-density approximation (LDA) 
band structure*® (see Supplementary Information section V). 

We find that analysing the features of the energy-resolved DFT 
maps provides direct evidence for mass enhancement of quasiparticles, 
in unison with related signatures in the tunnelling spectra. Figure 5a 
and b shows line sections of the DFT maps plotted along two high- 
symmetry directions (the thick white lines in Fig. 4b) as a function of 
energy for CeCo(Ing 99g5Hgo.0015)5 at 20 K, and in Fig. 5c we show their 
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Figure 4 | Spectroscopic mapping of quasiparticle interference (QPI). 

a, b, Real space (a) and corresponding DFT (b) of conductance maps 

(—200 mV, 1.6 nA) at selected energies (top left of each panel) measured on 
surface A of CeCo(Ino 99gsHgo.o015)s at 20 K. ¢, d, Similar DFTs for 

CeCo(Ino 99gsHgo.0015)5 at 70 K (—150 mV, 1.5 nA; c) and on the corresponding 
surface A for CeRhIns at 20 K (—200 mV, 3.0 nA; d) at selected energies. 
Arrows indicate the position of the Bragg peaks at (27/a, 0) and (0, 2n/a). All 
DFTs were four-fold symmetrized (due to the four-fold crystal symmetry) to 
enhance resolution (see Supplementary Information section IV). The intensity 
is represented on a linear scale. PSD, power spectrum density. 


corresponding spatially averaged spectrum. The square-like regions of 
enhanced quasiparticle scattering in Fig. 4b appear in the line sections 
of Fig. 5a, b as energy-dependent bands of scattering, which become 
strongly energy dependent near the Fermi energy. Clearly the scatter- 
ing of the quasiparticle excitations in the energy window near the 
direct hybridization gap has a flatter energy-momentum structure 
than that at energies away from the gap. This is a direct signature of 
the quasiparticles acquiring heavy effective mass at low energies near 
the Fermi energy. Detailed analysis of one of the QPI bands estimates 
the mass enhancement near the Fermi energy to be about 30m (Fig. 5a 
inset), a value which is close to that seen in quantum oscillation studies 
of CeColn; (refs 39, 40). Our model calculation, which describes the 
spectroscopic lineshapes on the different surfaces, can also be 
extended to reproduce the signatures of mass enhancement in the 
QPI data (see Supplementary Information section I). 

Contrasting low temperature QPI patterns on CeCo(Ino 99gsHo .0015)5 
to measurements on the same compound at high temperatures (70 K, 
Fig. 5d, e), where the hybridization gap is weak (Fig. 5f), or to measure- 
ments on CeRhIns (20K, Fig. 5g, h), where signatures of a hybridiza- 
tion gap are absent in the tunnelling spectra (Fig. 5i), confirms that 
the development of this gap results in apparent splitting of the bands 
and is responsible for both the scattering and the heavy effective 
mass in the QPI measurements. Furthermore, these measurements 
show that the underlying band structure responsible for the scattering 
wavevectors away from the Fermi energy is relatively similar between 
CeCo(Ino 99gsHgo.0015)5 and CeRhIn;. Only when f electrons of the 
Kondo lattice begin to strongly hybridize with conduction electrons 
and modify the band structure within a relatively narrow energy 
window (30 meV) do we see signatures of heavy-fermion excitations 
in QPI measurements, signalling a transition from a small to a large 
Fermi surface (see Supplementary Information section V). 
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Figure 5 | Visualizing quasiparticle mass enhancement. a, b, Energy- 
momentum structure of the QPI bands extracted from line sections (solid white 
lines in Fig. 4b) along the atomic direction (277/a, 0) (a) and along the zone 
diagonal (n/a, t/a) (b) in CeCo(Ing 99gsHgo.0015)5 at 20 K. The solid red line 
represents a fourth-order polynomial fit to the data. Inset in a shows the 
effective mass m*/mpo as a function of momenta obtained from the curvature 
(A (d?E/dq?) ) of the outer band (solid red line in a). ¢, Average spectrum 
on surface A of CeCo(Ino 99gsHgo o015)s at 20 K, reflecting the suppression of 
scattering in the QPI bands. Similar measurements performed in 

CeCo(Ino 99gsH¥o.0015)5 at 70 K (d-f) and in CeRhIns at 20 K (g-i). The 
intensity is represented on a linear scale. 


Signatures of quantum criticality 

The ability to tunnel through the fcomponent of the heavy quasiparticles 
on surface B of CeCo(Ing 99gsHo.0015)5 provides an opportunity to 
probe the lifetime of the heavy quasiparticles as a function of temper- 
ature in a system that is close to a QCP. The narrow dispersion of the f 
band results in a direct connection between the experimentally 
measured width of the peak in the density of states near the Fermi 
energy (Fig. 3b) and the lifetime of the heavy quasiparticles. Analysis 
of this width measured at different temperatures is displayed in Fig. 6a 
(see Supplementary Information section VI), and shows a strong 
temperature dependence with a finite intercept (~3.5 meV) in the 
limit of zero temperature. The finite width at zero temperature can be 
understood as a consequence of a small but finite dispersion of the f 
band as well as a finite probability of tunnelling into the spd electrons 
(see Supplementary Information section I). However, the large linear 
slope in Fig. 6a (that is, larger than 3/2 kgT, where kg is Boltzmann’s 
constant and T is temperature) indicates that the lifetime of the f 
electrons, as opposed to thermal broadening, is strongly influencing 
the spectra and its temperature dependence. Consistent with this 
observation, we also find that to capture the temperature evolution 
of the spectra in Fig. 3b, we have to use rather large values of scattering 
rate (inverse lifetime) of the f component of the heavy quasiparticles, 
Y= h/t, (where hi is Planck’s constant h divided by 21 and 1; is the 
lifetime of the quasiparticles), in our model calculations (Fig. 3d). 

A T-linear scattering rate (or inverse lifetime) for the heavy 
quasiparticles is consistent with the expectation that CeCoIn; is close 
to a QCP, because for systems tuned close to such transitions, 
temperature is the only relevant energy scale available to determine 
the quasiparticle lifetime, resulting in h/t, « kT (refs 18, 47). 
However, a more precise signature of a QCP would be the observation 


©2012 Macmillan Publishers Limited. All rights reserved 


a 30 + + . 
oO FWHM 
o Thermally deconvoluted 
FWHM 
— 3/2k,T 
20 
S 4 
oO 
S } 
: ? 
a0 
é 2 ; 
10 $ h 
h 
0 
0 20 40 60 


Temperature (K) 


(di/dV) (kT 9%) 


Figure 6 | Signatures of quantum criticality. a, Full width at half maximum 
(FWHM) of the heavy quasiparticle peak (red squares) as a function of 
temperature extracted from a Gaussian fit to the sharp lineshape of the spectra 
of Fig. 3b after a smooth background subtraction (see Supplementary 
Information section VI). Blue squares represent the thermally deconvoluted 
FWHM corresponding to the intrinsic width in the absence of thermal 
broadening. The green line represents 3/2k,T. Error bars, 1 s.d. b, Energy- 
temperature scaling of the different spectra of Fig. 3b, after the removal of a 
temperature independent background (see Supplementary Information 
sections VI and VII), within a narrow energy window near the Fermi energy. 
Key shows temperature T in K. Inset shows the ‘goodness of the collapse’ 
(colour scale) as a function of the critical exponents « and f. 


of energy-temperature scaling of experimental quantities near such 
transitions. In fact, recent theoretical work suggests that the instability 
of the Fermi surface near a QCP should result in scaling properties of 
the single-particle excitation that can be directly probed in measure- 
ments of the tunnelling density of states**. To test this hypothesis, we 
examine the lineshape of the tunnelling spectra on surface B of 
CeCo(Ino 908sHgo.0015)s near the chemical potential at different 
temperatures, and attempt to scale the data (Fig. 3b) by plotting 
(dI/dV)<(kgI”) as a function of E/(kgT)’. (Here (dI/dV)s is the 
background-subtracted spectra of Fig. 3b (see Supplementary Informa- 
tion section VI) and E is the energy of the tunnelling quasiparticles.) 
We find that using the exponents « = 0.53 and f/f =1 results in a 
collapse of the data at different temperatures on a single curve in the 
low bias region (see Fig. 6b and Supplementary Information section 
VII). Although an understanding of the value of the exponent « is 
currently lacking, the linear power f confirms our hypothesis of 
energy-temperature scaling associated with proximity to a QCP. 
These results indicate that the heavy quasiparticles in CeColns are 
damped because of critical fluctuations rather than the typical scatter- 
ing that is expected in a Fermi liquid (T° dependence). Similar energy— 
temperature scalings, with anomalous exponents (x), have been 
previously observed in the dynamical spin susceptibility of other 
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heavy-fermion systems near QCPs****°°. However, here we show for 
the first time that the signatures of scaling and critical phenomena 
appear in the spectroscopic properties of the quasiparticle excitations. 


Conclusion and outlook 


The experimental results and the model calculations presented here 
provide a comprehensive picture of how heavy-fermion excitations in 
the 115 Ce-based Kondo lattice systems emerge with lowering of 
temperature or as a result of chemical tuning of the interaction 
between the Ce f electrons and the conduction electrons. The changes 
in the scattering properties of the quasiparticles directly signal the 
flattening of their energy-momentum structure and the emergence 
of heavy quasiparticles near the Fermi energy. Such changes are also 
consistent with the predicted evolution from a small to a large Fermi 
surface as the localized f electrons hybridize with the conduction 
electrons. The sensitivity of the tunnelling to the surface termination 
and the successful modelling of these data provide direct spectro- 
scopic evidence of the composite nature of heavy fermions and offer 
a unique method to disentangle their components. 

Our experiments also demonstrate that the emergent heavy 
quasiparticles in our system are strongly scattered and show signatures 
of scaling associated with critical damping of excitations in proximity 
to a QCP. Like many other heavy-fermion systems, thermodynamic 
and transport studies of the 115 systems have shown evidence of 
quantum criticality, but such signatures have not been previously 
isolated in an electron spectroscopy measurement, as described here. 
Such spectroscopic signatures are direct evidence for the breakdown 
of coherent fermionic excitations approaching a QCP. Future 
extension of our measurements to lower temperatures could probe 
the interplay between quantum fluctuations and the appearance of 
superconductivity, an issue which continues to be one of the most 
debated in condensed matter physics. 


METHODS SUMMARY 


The single crystals of CeCoIn;, CeCo(Ing 99gsHo.o015)5 and CeRhIns used for this 
study were grown from excess indium at Los Alamos National Laboratory. Small, 
flat crystals were oriented along the crystallographic axes and cut into sizes 
suitable for STM measurements (~2 X 2 X 0.2 mm*). The samples were cleaved 
on a surface perpendicular to the c axis in ultrahigh vacuum (UHV) and trans- 
ferred in situ to the microscope head. Differential conductance (dI/dV) measure- 
ments were performed using standard lock-in techniques. Approximately ten 
different samples of CeCoIns, CeCo(Ino.99gsHgp.0015)s and CeRhIns were success- 
fully cleaved and studied, and the spectroscopic data collected were reproducible 
on the corresponding identical exposed surfaces of the different samples. Spectra 
measured at different locations on each surface showed negligible variations. The 
spectra presented here (in the main paper) are averaged over approximately 200 
individual spectra measured over an area of at least 100 A X 100 A. The spectro- 
scopic lineshapes showed negligible variations as the tip height was varied (vari- 
ation of the tunnelling current by two orders of magnitude). 
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Structure, function and diversity of the 
healthy human microbiome 


The Human Microbiome Project Consortium* 


Studies of the human microbiome have revealed that even healthy individuals differ remarkably in the microbes that 
occupy habitats such as the gut, skin and vagina. Much of this diversity remains unexplained, although diet, 
environment, host genetics and early microbial exposure have all been implicated. Accordingly, to characterize the 
ecology of human-associated microbial communities, the Human Microbiome Project has analysed the largest cohort 
and set of distinct, clinically relevant body habitats so far. We found the diversity and abundance of each habitat’s 
signature microbes to vary widely even among healthy subjects, with strong niche specialization both within and among 
individuals. The project encountered an estimated 81-99% of the genera, enzyme families and community 
configurations occupied by the healthy Western microbiome. Metagenomic carriage of metabolic pathways was 
stable among individuals despite variation in community structure, and ethnic/racial background proved to be one of 
the strongest associations of both pathways and microbes with clinical metadata. These results thus delineate the range 
of structural and functional configurations normal in the microbial communities of a healthy population, enabling future 


characterization of the epidemiology, ecology and translational applications of the human microbiome. 


A total of 4,788 specimens from 242 screened and phenotyped adults’ 
(129 males, 113 females) were available for this study, representing the 
majority of the target Human Microbiome Project (HMP) cohort of 
300 individuals. Adult subjects lacking evidence of disease were 
recruited based on a lengthy list of exclusion criteria; we will refer 
to them here as ‘healthy’, as defined by the consortium clinical 
sampling criteria (K. Aagaard et al., manuscript submitted). 
Women were sampled at 18 body habitats, men at 15 (excluding three 
vaginal sites), distributed among five major body areas. Nine specimens 
were collected from the oral cavity and oropharynx: saliva; buccal 
mucosa (cheek), keratinized gingiva (gums), palate, tonsils, throat 
and tongue soft tissues, and supra- and subgingival dental plaque (tooth 
biofilm above and below the gum). Four skin specimens were collected 
from the two retroauricular creases (behind each ear) and the two 
antecubital fossae (inner elbows), and one specimen for the anterior 
nares (nostrils). A self-collected stool specimen represented the micro- 
biota of the lower gastrointestinal tract, and three vaginal specimens 
were collected from the vaginal introitus, midpoint and posterior 
fornix. To evaluate within-subject stability of the microbiome, 131 
individuals in these data were sampled at an additional time point 
(mean 219 days and s.d. 69 days after first sampling, range 35-404 days). 
After quality control, these specimens were used for 16S rRNA gene 
analysis via 454 pyrosequencing (abbreviated henceforth as 16S profil- 
ing, mean 5,408 and s.d. 4,605 filtered sequences per sample); to assess 
function, 681 samples were sequenced using paired-end Illumina 
shotgun metagenomic reads (mean 2.9 gigabases (Gb) and s.d. 2.1 Gb 
per sample)’. More details on data generation are provided in related 
HMP publications’ and in Supplementary Methods. 


Microbial diversity of healthy humans 

The diversity of microbes within a given body habitat can be defined as 
the number and abundance distribution of distinct types of organisms, 
which has been linked to several human diseases: low diversity in the 
gut to obesity and inflammatory bowel disease*”, for example, and high 
diversity in the vagina to bacterial vaginosis’. For this large study 


involving microbiome samples collected from healthy volunteers at 
two distinct geographic locations in the United States, we have defined 
the microbial communities at each body habitat, encountering 81-99% 
of predicted genera and saturating the range of overall community 
configurations (Fig. 1, Supplementary Fig. 1 and Supplementary 
Table 1; see also Fig. 4). Oral and stool communities were especially 
diverse in terms of community membership, expanding prior observa- 
tions’, and vaginal sites harboured particularly simple communities 
(Fig. 1a). This study established that these patterns of alpha diversity 
(within samples) differed markedly from comparisons between 
samples from the same habitat among subjects (beta diversity, 
Fig. 1b). For example, the saliva had among the highest median alpha 
diversities of operational taxonomic units (OTUs, roughly species level 
classification, see http://hmpdacc.org/HMQCP), but one of the lowest 
beta diversities—so although each individual’s saliva was ecologically 
rich, members of the population shared similar organisms. Conversely, 
the antecubital fossae (skin) had the highest beta diversity but were 
intermediate in alpha diversity. The vagina had the lowest alpha diversity, 
with quite low beta diversity at the genus level but very high among 
OTUs due to the presence of distinct Lactobacillus spp. (Fig. 1b). The 
primary patterns of variation in community structure followed the 
major body habitat groups (oral, skin, gut and vaginal), defining as a 
result the complete range of population-wide between-subject variation 
in human microbiome habitats (Fig. 1c). Within-subject variation over 
time was consistently lower than between-subject variation, both in 
organismal composition and in metabolic function (Fig. 1d). The 
uniqueness of each individual’s microbial community thus seems to 
be stable over time (relative to the population as a whole), which may be 
another feature of the human microbiome specifically associated with 
health. 

No taxa were observed to be universally present among all body 
habitats and individuals at the sequencing depth employed here, 
unlike several pathways (Fig. 2 and Supplementary Fig. 2, see below), 
although several clades demonstrated broad prevalence and relatively 
abundant carriage patterns®’. Instead, as suggested by individually 


*Lists of participants and their affiliations appear at the end of the paper. 
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Figure 1 | Diversity of the human microbiome is concordant among 
measures, unique to each individual, and strongly determined by microbial 
habitat. a, Alpha diversity within subjects by body habitat, grouped by area, as 
measured using the relative inverse Simpson index of genus-level phylotypes 
(cyan), 16S rRNA gene OTUs (blue), shotgun metagenomic reads matched to 
reference genomes (orange), functional modules (dark orange), and enzyme 
families (yellow). The mouth generally shows high within-subject diversity and 
the vagina low diversity, with other habitats intermediate; variation among 
individuals often exceeds variation among body habitats. b, Bray—Curtis beta 
diversity among subjects by body habitat, colours as for a. Skin differs most 
between subjects, with oral habitats and vaginal genera more stable. Although 
focused studies****”, each body habitat in almost every subject was 
characterized by one or a few signature taxa making up the plurality of 
the community (Fig. 3). Signature clades at the genus level formed on 
average anywhere from 17% to 84% of their respective body habitats, 
completely absent in some communities (0% at this level of detection) 
and representing the entire population (100%) in others. Notably, less 
dominant taxa were also highly personalized, both among individuals 
and body habitats; in the oral cavity, for example, most habitats are 
dominated by Streptococcus, but these are followed in abundance by 
Haemophilus in the buccal mucosa, Actinomyces in the supragingival 
plaque, and Prevotella in the immediately adjacent (but low oxygen) 
subgingival plaque’®. 

Additional taxonomic detail of the human microbiome was pro- 
vided by identifying unique marker sequences in metagenomic data" 
(Fig. 3a) to complement 16S profiling (Fig. 3b). These two profiles 
were typically in close agreement (Supplementary Fig. 3), with the 
former in some cases offering more specific information on members 
of signature genera differentially present within habitats (for example, 
vaginal Prevotella amnii and gut Prevotella copri) or among indivi- 
duals (for example, vaginal Lactobacillus spp.) One application of this 
specificity was to confirm the absence of NIAID (National Institute of 


208 | NATURE | VOL 486 | 14 JUNE 2012 


IB Phylotypes (16S) 


IB OTUs (168) 

I Reference genomes (WGS) 
i Metabolic modules (WGS) 
[Gene index (WGS) 


Gastrointestinal 


PC2 (4.4%) 


PCT (13%) 


BB Technical replicates (16S) Hl Between visits (16S) i Between visits (WGS) 
44 IB Between subjects (16S) Hl Between subjects (WGS) 
- a: 


1 I - 
34 + T tl ! 
> H i TI io I I 
eS 1 Tr 1 ‘“ 1 +1 
YQ 2, 1 | r 
on ee en 
o 14 I 1 Tt ne 1? (fa t 
© ! ! - I ae ' I Ty 
Z & t \ 1 |e! ~ 1 H es oe 
®B OF * * Lf i . "g +7! 
o 1 
be 1 | : | I ' i 
y717 1 1 ry T -f 
3 1 4 - - 
4 ! 14 | 
“2 ty iy -{ 
ay itl + 
-34 1 
Sah 1¥ 4 
\ ly 
-4 T T T T T 
Urogenital Skin Nasal Gastrointestinal Oral 


Vaginal introitus 


alpha- and beta-diversity are not directly comparable, changes in structure 
among communities (a) occupy a wider dynamic range than do changes within 
communities among individuals (b). ¢, Principal coordinates plot showing 
variation among samples demonstrates that primary clustering is by body area, 
with the oral, gastrointestinal, skin and urogenital habitats separate; the nares 
habitat bridges oral and skin habitats. d, Repeated samples from the same 
subject (blue) are more similar than microbiomes from different subjects (red). 
Technical replicates (grey) are in turn more similar; these patterns are 
consistent for all body habitats and for both phylogenetic and metabolic 
community composition. See previously described sample counts’ for all 
comparisons. 


Allergy and Infectious Diseases) class A-C pathogens above 0.1% 
abundance (aside from Staphylococcus aureus and Escherichia coli) 
from the healthy microbiome, but the near-ubiquity and broad dis- 
tribution of opportunistic ‘pathogens’ as defined by PATRIC”. 
Canonical pathogens including Vibrio cholerae, Mycobacterium 
avium, Campylobacter jejuni and Salmonella enterica were not 
detected at this level of sensitivity. Helicobacter pylori was found in 
only two stool samples, both at <0.01%, and E. coli was present at 
>0.1% abundance in 15% of stool microbiomes (>0% abundance in 
61%). Similar species-level observations were obtained for a small 
subset of stool samples with 454 pyrosequencing metagenomics data 
using PhylOTU’"™*. In total 56 of 327 PATRIC pathogens were 
detected in the healthy microbiome (at >1% prevalence of >0.1% 
abundance, Supplementary Table 2), all opportunistic and, strikingly, 
typically prevalent both among hosts and habitats. The latter is in 
contrast to many of the most abundant signature taxa, which were 
usually more habitat-specific and variable among hosts (Fig. 3a, b). 
This overall absence of particularly detrimental microbes supports the 
hypothesis that even given this cohort’s high diversity, the microbiota 
tend to occupy a range of configurations in health distinct from many 
of the disease perturbations studied to date*”’. 
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Figure 2 | Carriage of microbial taxa varies while metabolic pathways 
remain stable within a healthy population. a, b, Vertical bars represent 
microbiome samples by body habitat in the seven locations with both shotgun 
and 16S data; bars indicate relative abundances colored by microbial phyla 
from binned OTUs (a) and metabolic modules (b). Legend indicates most 
abundant phyla/pathways by average within one or more body habitats; RC, 
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retroauricular crease. A plurality of most communities’ memberships consists 
of a single dominant phylum (and often genus; see Supplementary Fig. 2), but 
this is universal neither to all body habitats nor to all individuals. Conversely, 
most metabolic pathways are evenly distributed and prevalent across both 
individuals and body habitats. 
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Figure 3 | Abundant taxa in the human microbiome that have been 
metagenomically and taxonomically well defined in the HMP population. 
a-c, Prevalence (intensity, colour denoting phylum/class) and abundance when 
present (size) of clades in the healthy microbiome. The most abundant 
metagenomically-identified species (a), 16S-identified genera (b) and 
PATRIC” pathogens (metagenomic) (c) are shown. d, e, The population size 


and sequencing depths of the HMP have well defined the microbiome at all 
assayed body sites, as assessed by saturation of added community metabolic 
configurations (rarefaction of minimum Bray—Curtis beta-diversity of 
metagenomic enzyme class abundances to nearest neighbour, inter-quartile 
range over 100 samples) (d) and phylogenetic configurations (minimum 16S 
OTU weighted UniFrac distance to nearest neighbour) (e). 
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Carriage of specific microbes 

Inter-individual variation in the microbiome proved to be specific, 
functionally relevant and personalized. One example of this is illu- 
strated by the Streptococcus spp. of the oral cavity. The genus dominates 
the oropharynx"®, with different species abundant within each sampled 
body habitat (see http://hmpdacc.org/HMSMCP) and, even at the 
species level, marked differences in carriage within each habitat among 
individuals (Fig. 4a). As the ratio of pan- to core-genomes is high in 
many human-associated microbes”, this variation in abundance could 
be due to selective pressures acting on pathways differentially present 
among Streptococcus species or strains (Fig. 4b). Indeed, we observed 
extensive strain-level genomic variation within microbial species in 
this population, enriched for host-specific structural variants around 
genomic islands (Fig. 4c). Even with respect to the single Streptococcus 
mitis strain B6, gene losses associated with these events were common, 


for example differentially eliminating S. mitis carriage of the V-type 
ATPase or choline binding proteins cbp6 and cbp12 among subsets of 
the host population (Fig. 4d). These losses were easily observable by 
comparison to reference isolate genomes, and these initial findings 
indicate that microbial strain- and host-specific gene gains and 
polymorphisms may be similarly ubiquitous. 

Other examples of functionally relevant inter-individual variation 
at the species and strain levels occurred throughout the microbiome. 
In the gut, Bacteroides fragilis has been shown to prime T-cell 
responses in animal models via the capsular polysaccharide A‘, 
and in the HMP stool samples this taxon was carried at a level of at 
least 0.1% in 16% of samples (over 1% abundance in 3%). Bacteroides 
thetaiotaomicron has been studied for its effect on host gastrointestinal 
metabolism’? and was likewise common at 46% prevalence. On the skin, 
S. aureus, of particular interest as the cause of methicillin-resistant 
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Figure 4 | Microbial carriage varies between subjects down to the species 
and strain level. Metagenomic reads from 127 tongue samples spanning 90 
subjects were processed with MetaPhlAn to determine relative abundances for 
each species. a, Relative abundances of 11 distinct Streptococcus spp. In addition 
to variation in broader clades (see Fig. 2), individual species within a single 
habitat demonstrate a wide range of compositional variation. Inset illustrates 
average tongue sample composition. b, Metabolic modules present/absent 
(grey/white) in KEGG” reference genomes of tongue streptococci denote 
selected areas of strain-specific functional differentiation. cpnt, component. 
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c, Comparative genomic coverage for the single Streptococcus mitis B6 strain. 
Grey dots are median reads per kilobase per million reads (RPKM) for 1-kb 
windows, grey bars are the 25th to 75th percentiles across all samples, red line 
the LOWESS-smoothed average. Red bars at the bottom highlight predicted 
genomic islands”. Large, discrete, and highly variable islands are commonly 
under-represented. d, Two islands are highlighted, V (V-type H* ATPase 
subunits I, K, E, C, F, A and B) and CH (choline-binding proteins cbp6 and 
cbp12), indicating functional cohesion of strain-specific gene loss within 
individual human hosts. 
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S. aureus (MRSA) infections, had 29% nasal and 4% skin carriage 
rates, roughly as expected’®. Close phylogenetic relatives such as 
Staphylococcus epidermidis (itself considered commensal) were, in 
contrast, universal on the skin and present in 93% of nares samples, 
and at the opposite extreme Pseudomonas aeruginosa (a representative 
Gram-negative skin pathogen) was completely absent from both body 
habitats (0% at this level of detection). These and the data above suggest 
that the carriage pattern of some species in the human microbiome 
may be analogous to genetic traits, where recessive alleles of modest 
risk are maintained in a population. In the case of the human micro- 
biome, high-risk pathogens remain absent, whereas species that pose a 
modest degree of risk also seem to be stably maintained in this 
ecological niche. 

Finally, microorganisms within and among body habitats exhibited 
relationships suggestive of driving physical factors such as oxygen, 
moisture and pH, host immunological factors, and microbial inter- 
actions such as mutualism or competition” (Supplementary Fig. 4). 
Both overall community similarity and microbial co-occurrence and 
co-exclusion across the human microbiome grouped the 18 body 
habitats together into four clusters corresponding to the five target 
body areas (Supplementary Fig. 4a, b). There was little distinction 
among different vaginal sites, with Lactobacillus spp. dominating all 
three and correlating in abundance. However, Lactobacillus varied 
inversely with the Actinobacteria and Bacteroidetes (see Supplemen- 
tary Fig. 4c and Figs 2 and 3), as also observed in a previous cohort’. 
Gut microbiota relationships primarily comprised inverse associa- 
tions with the Bacteroides, which ranged from dominant in some 
subjects to a minority in others who carried a greater diversity of 
Firmicutes. A similar progression was evident in the skin communities, 
dominated by one of Staphylococcus (phylum  Firmicutes), 
Propionibacterium, or Corynebacterium (both phylum Actinobacteria), 
with a continuum of oral organisms (for example, Streptococcus) appear- 
ing in nares communities (Supplementary Fig. 4c). These observations 
suggest that microbial community structure in these individuals 
may sometimes occupy discrete configurations and under other 
circumstances vary continuously, a topic addressed in more detail by 
several HMP investigations (ref. 6 and unpublished results). An 
individual’s location within such configurations is indicative of current 
microbial carriage (including pathogens) and of the community’s 
ability to resist future pathogen acquisition or dysbiosis; it may thus 
prove to be associated with disease susceptibility or other phenotypic 
characteristics. 


Microbiome metabolism and function 


As the first study to include both marker gene and metagenomic data 
across body habitats from a large human population, we additionally 
assessed the ecology of microbial metabolic and functional pathways 
in these communities. We reconstructed the relative abundances of 
pathways in community metagenomes”, which were much more 
constant and evenly diverse than were organismal abundances 
(Fig. 2b, see also Fig. 1), confirming this as an ecological property of 
the entire human microbiome’. We were likewise able to determine 
for the first time that taxonomic and functional alpha diversity across 
microbial communities significantly correlate (Spearman of inverse 
Simpson’s r= 0.60, P= 3.6 X 10°, n= 661), the latter within a 
more proscribed range of community configurations (Supplemen- 
tary Fig. 5). 

Unlike microbial taxa, several pathways were ubiquitous among 
individuals and body habitats. The most abundant of these ‘core’ 
pathways include the ribosome and translational machinery, nucleo- 
tide charging and ATP synthesis, and glycolysis, and reflect the basics 
of host-associated microbial life. Also in contrast to taxa, few path- 
ways were highly variable among subjects within any body habitat; 
exceptions included the Sec (orally, pathway relative abundance 
s.d. = 0.0052; total mean of oral standard deviations = 0.0011 with 
s.d. = 0.0016) and Tat (globally, pathway s.d.= 0.0055; mean of 
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global standard deviations = 0.0023 with s.d. = 0.0033) secretion sys- 
tems, indicating a high degree of host-microbe and microbe—-microbe 
interactions in the healthy human microbiota. This high variability 
was particularly present in the oral cavity; for phosphate, mono- and 
di-saccharide, and amino acid transport in the mucosa; and also for 
lipopolysaccharide biosynthesis and spermidine/putrescine synthesis 
and transport on the plaque and tongue (http://hmpdacc.org/ 
HMMRC). The stability and high metagenomic abundance of this 
housekeeping ‘core’ contrasts with the greater variability and lower 
abundance of niche-specific functionality in rare but consistently 
present pathways; for example, spermidine biosynthesis, methionine 
degradation and hydrogen sulphide production, all examples highly 
prevalent in gastrointestinal body sites (non-zero in >92% of 
samples) but at very low abundance (median relative abundance 
< 0.0052). This ‘long tail’ of low-abundance genes and pathways also 
probably encodes much of the uncharacterized biomolecular function 
and metabolism of these metagenomes, the expression levels of which 
remain to be explored in future metatranscriptomic studies. 

Protein families showed diversity and prevalence trends similar to 
those of full pathways, ranging from maxima of only ~ 16,000 unique 
families per community in the vagina to almost 400,000 in the oral 
cavity (Fig. la, b; http://hmpdacc.org/HMGI). A remarkable fraction 
of these families were indeed functionally uncharacterized, including 
those detected by read mapping, with a minimum in the oral cavity 
(mean 58% s.d. 6.8%) and maximum in the nares (mean 77% s.d. 
11%). Likewise, many genes annotated from assemblies could not 
be assigned a metabolic function, with a minimum in the vagina 
(mean 78% s.d. 3.4%) and maximum in the gut (mean 86% s.d. 
0.9%). The latter range did not differ substantially by body habitat 
and is in close agreement with previous comprehensive gene catalogues 
of the gut metagenome’. Taken together with the microbial variation 
observed above throughout the human microbiome, functional vari- 
ation among individuals might indicate pathways of particular import- 
ance in maintaining community structure in the face of personalized 
immune, environmental or dietary exposures among these subjects. 
Determining the functions of uncharacterized core and variable protein 
families will be especially essential in understanding role of the micro- 
biota in health and disease. 


Correlations with host phenotype 


We finally examined relationships associating both clades and 
metabolism in the microbiota with host properties such as age, 
gender, body mass index (BMI), and other available clinical metadata 
(Fig. 5 and Supplementary Table 3). Using a sparse multivariate 
model, 960 microbial, enzymatic or pathway abundances were sig- 
nificantly associated with one or more of 15 subject phenotype and 
sample metadata features. A wide variety of taxa, gene families and 
metabolic pathways were differentially distributed with subject 
ethnicity at every body habitat (Fig. 5a), representing the phenotype 
with the greatest number (266 at false discovery rate (FDR) q < 0.2) of 
total associations with the microbiome. Vaginal pH has also been 
observed to correlate with microbiome composition’, and we detected 
in this population both the expected reduction in Lactobacillus at high 
pH and a corresponding increase in metabolic diversity (Fig. 5b). 
Intriguingly, and not previously observed, subject age was most asso- 
ciated with a collection of highly differential metagenomically 
encoded pathways on the skin (Fig. 5c), as well as shifts in skin clades 
including retroauricular Firmicutes (P = 1.0 X 10° *, q = 0.033). The 
examples of associations with ethnicity and vaginal pH are among the 
strongest associations with the microbiome, however, and most cor- 
relates (for example, with subject BMI, Fig. 5d) are more representa- 
tively modest. This lower degree of correlation held for most available 
biometrics (gender, temperature, blood pressure, etc.), with even the 
most significant associations possessing generally low effect sizes and 
considerable unexplained variance. We conclude that most variation 
in the human microbiome is not well explained by these phenotypic 
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Figure 5 | Microbial community membership and function correlates with 
host phenotype and sample metadata. a-d, The pathway and clade 
abundances most significantly associated (all FDR q < 0.2) using a multivariate 
linear model with subject race or ethnicity (a), vaginal posterior fornix pH 
(b), subject age (c) and BMI (d). Scatter plots of samples are shown with lines 


metadata, and other potentially important factors such as short- and 
long-term diet, daily cycles, founder effects such as mode of delivery, 
and host genetics should be considered in future analyses. 


Conclusions 

This extensive sampling of the human microbiome across many sub- 
jects and body habitats provides an initial characterization of the 
normal microbiota of healthy adults in a Western population. The 
large sample size and consistent sampling of many sites from the same 
individuals allows for the first time an understanding of the relationships 
among microbes, and between the microbiome and clinical parameters, 
that underpin the basis for individual variation—variation that may 
ultimately be critical for understanding microbiome-based disorders. 
Clinical studies of the microbiome will be able to leverage the resulting 
extensive catalogues of taxa, pathways and genes’, although they must 
also still include carefully matched internal controls. The uniqueness of 
each individual’s microbiome even in this reference population argues 
for future studies to consider prospective within-subjects designs where 
possible. The HMP’s unique combination of organismal and functional 
data across body habitats, encompassing both 16S and metagenomic 
profiling, together with detailed characterization of each subject, has 
allowed us and subsequent studies to move beyond the observation of 
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BMI 


indicating best simple linear fit. Race/ethnicity and vaginal pH are particularly 
strong associations; age and BMI are more representative of typically modest 
phenotypic associations (Supplementary Table 3), suggesting that variation in 
the healthy microbiota may correspond to other host or environmental factors. 


variability in the human microbiome to ask how and why these microbial 
communities vary so extensively. 

Many details remain for further work to fill in, building on this 
reference study. How do early colonization and lifelong change vary 
among body habitats? Do epidemiological patterns of transmission of 
beneficial or harmless microbes mirror patterns of transmission of 
pathogens? Which co-occurrences among microbes reflect shared 
response to the environment, as opposed to competitive or mutualistic 
interactions? How large a role does host immunity or genetics play in 
shaping patterns of diversity, and how do the patterns observed in this 
North American population compare to those around the world? Future 
studies building on the gene and organism catalogues established by the 
Human Microbiome Project, including increasingly detailed investi- 
gations of metatranscriptomes and metaproteomes, will help to unravel 
these open questions and allow us to more fully understand the links 
between the human microbiome, health and disease. 


METHODS SUMMARY 


Microbiome samples were collected from up to 18 body sites at one or two time 
points from 242 individuals clinically screened for absence of disease (K. Aagaard 
et al., manuscript submitted). Samples were subjected to 16S ribosomal RNA gene 
pyrosequencing (454 Life Sciences), and a subset were shotgun-sequenced for 
metagenomics using the Illumina GAIIx platform’. 16S data processing and 
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diversity estimates were performed using QIIME”, and metagenomic data were 
taxonomically profiled using MetaPhlAn"’, metabolically profiled by HUMAnN”, 
and assembled for gene annotation and clustering into a unique catalogue’. 
Potential pathogens were identified using the PATRIC database”, isolate reference 
genome annotations drawn from KEGG”, and reference genome mapping per- 
formed by BWA” to a reduced set of genomes to which short reads could be 
matched”*. Microbial associations were assessed by similarity measures accounting 
for compositionality~', and phenotypic association testing was performed in R. All 
data and additional protocol details are available at http://hmpdacc.org. Full 
methods accompany this paper in the Supplementary Information. 
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A variety of microbial communities and their genes (the microbiome) exist throughout the human body, with 
fundamental roles in human health and disease. The National Institutes of Health (NIH)-funded Human Microbiome 
Project Consortium has established a population-scale framework to develop metagenomic protocols, resulting in a 
broad range of quality-controlled resources and data including standardized methods for creating, processing and 
interpreting distinct types of high-throughput metagenomic data available to the scientific community. Here we 
present resources from a population of 242 healthy adults sampled at 15 or 18 body sites up to three times, which 
have generated 5,177 microbial taxonomic profiles from 16S ribosomal RNA genes and over 3.5 terabases of 
metagenomic sequence so far. In parallel, approximately 800 reference strains isolated from the human body have 
been sequenced. Collectively, these data represent the largest resource describing the abundance and variety of the 
human microbiome, while providing a framework for current and future studies. 


Advances in sequencing technologies coupled with new bioinformatic 
developments have allowed the scientific community to begin to invest- 
igate the microbes that inhabit our oceans, soils, the human body and 
elsewhere’. Microbes associated with the human body include eukaryotes, 
archaea, bacteria and viruses, with bacteria alone estimated to outnumber 
human cells within an individual by an order of magnitude. Our 
knowledge of these communities and their gene content, referred to 
collectively as the human microbiome, has until now been limited by a 
lack of population-scale data detailing their composition and function. 
The US NIH-funded Human Microbiome Project Consortium 
(HMP) brought together a broad collection of scientific experts to 
explore these microbial communities and their relationships with their 
human hosts. As such, the HMP’ has focused on producing reference 
genomes (viral, bacterial and eukaryotic), which provide a critical 
framework for subsequent metagenomic annotation and analysis, and 
on generating a baseline of microbial community structure and func- 
tion from an adult cohort defined by a carefully delineated set of clinical 
inclusion and exclusion criteria that we term ‘healthy’ in this study 
(http://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/GetPdf.cgi?id= 
phd002854.2). Investigations of the microbiome from this cohort 
incorporated several complementary analyses including: 16S ribosomal 
RNA (rRNA) gene sequence (16S) and taxonomic profiles, whole- 
genome shotgun (WGS) or metagenomic sequencing of whole com- 
munity DNA, and alignment of the assembled sequences to the reference 
microbial genomes from the human body**. Thus, the HMP comple- 
ments other large-scale sequence-based human microbiome projects 
such as the MetaHIT project°, which focused on examination of the 
gut microbiome using WGS data including samples from cohorts exhib- 
iting a wide range of health statuses and physiological characteristics. 
Additional projects supported by the HMP are investigating the 
association of specific components and dynamics of the microbiome 
with a variety of disease conditions, developing tools and technology 
including isolating and sequencing uncultured organisms, and study- 
ing the ethical, legal and social implications of human microbiome 
research (http://commonfund.nih.gov/hmp/fundedresearch.aspx). A 
comprehensive list of current publications from HMP projects is 
available at http://commonfund.nih.gov/hmp/publications.aspx. 


Here we detail the resources created so far by the HMP initiative 
including: clinical specimens (samples), reference genomes, sequen- 
cing and annotation protocols, methods and analyses. We describe 
the thousands of samples obtained from 15 or 18 distinct body sites 
from 242 donors over multiple time points that were processed at two 
clinical centres (Baylor College of Medicine (BCM) and Washington 
University School of Medicine). We also describe the laboratory and 
computational protocols developed for reliably generating and inter- 
preting the human microbiome data. HMP resources include both 
protocols for, and the subsequent data generated from, 16S and meta- 
genomic sequencing of human microbiome samples. During this 
study, these protocols were rigorously standardized and quality con- 
trolled for simultaneous use across four sequencing centres (BCM 
Human Genome Sequencing Center, The Broad Institute of 
Massachusetts Institute of Technology (MIT) and Harvard, the 
J. Craig Venter Institute and The Genome Institute at Washington 
University School of Medicine). In particular, we focus on the pro- 
duction of the first phase of metagenomic data sets (phase I) used for 
subsequent in-depth analyses, and we summarize standards and 
recommendations based on our experiences generating and analysing 
these data. An additional set of publications (many included in the 
references and in those of ref. 4) describe in further detail the micro- 
bial ecology and microbiological implications of these data. 
Collectively these resources and analyses represent an important 
framework for human microbiome research. 


HMP resource organization 

Supplementary Fig. 1 summarizes organization of the HMP, including 
the data processing and analytical steps, and the scientific entities 
gathered to conduct the project. An overview of available HMP data 
sets and additional resources are provided in Supplementary Tables 
1-3. Donors were recruited and enrolled into the HMP through the 
two clinical centres. Over 240 adults were carefully screened and phe- 
notyped before sampling one to three times at 15 (male) or 18 (female) 
body sites using a common sampling protocol (http://www.ncbi.nlm. 
nih.gov/projects/gap/cgi-bin/GetPdf.cgi?id=phd003190.2). All included 
subjects were between the ages of 18 and 40 years and had passed a 


*Lists of participants and their affiliations appear at the end of the paper. 
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screening for systemic health based on oral, cutaneous and body mass 
exclusion criteria (http://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/ 
GetPdf.cgi?id=phd002854.2) (K. Aagaard et al., manuscript submitted). 

A Data Analysis and Coordination Center (DACC) was created to 
serve as the central repository for all HMP WGS, 16S and reference 
genome sequence information generated by the four sequencing 
centres. The DACC supports access to analysis software, biological 
samples, clinical protocols, news, publication announcements and pro- 
ject statistics, and performed centralized analysis of HMP reference 
genome and WGS annotation in cooperation with the sequencing 
centres. All unprocessed 16S, WGS and reference genome sequence 
data are deposited at the National Center for Biotechnology Informa- 
tion (NCBI) (http://www.ncbi.nlm.nih.gov/bioproject/43021). Unless 
otherwise noted, all data sets and protocols described here are available 
to the scientific community at the DACC (http://hmpdacc.org). Specific 
data sets referred to in this work and available at the DACC are indi- 
cated in parentheses with the preface ‘RES’. 


Phase I 16S and WGS sequencing overview 


A set of 5,298 samples were collected from 242 adults (K. Aagaard 
et al., manuscript submitted; Table 1 and Supplementary Table 4), 
from which 16S and WGS data were generated for a total of 5,177 
taxonomically characterized communities (16S) and 681 WGS 
samples describing the microbial communities from habitats within 
the human airways, skin, oral cavity, gut and vagina. For a subset of 
560 samples, both data types were generated (Table 1). These efforts 
constitute our initial primary metagenomic data sets (phase I) 
described in more detail later. Additional efforts are ongoing to 
sequence and analyse the remaining samples from the complete 
HMP collection (11,174 primary specimens in total from 300 indivi- 
duals sampled up to three times over 22 months) (K. Aagaard et al., 
manuscript submitted). 


16S standards development and sequencing 

The goals of the HMP required that 16S sequences and profiles from 
data produced at the four participating sequencing centres be com- 
parable in a variety of downstream analyses; however, no suitable 
methodology was available at the commencement of the project. 
While establishing 16S protocols, we determined that many compo- 
nents of data production and processing can contribute errors and 
artefacts. We investigated methods that avoid these errors and their 
subsequent effects on taxonomic classification and operational 


Table 1 | HMP donor samples examined by 16S and WGS 


taxonomic unit (OTU)-based community structure. The results are 
discussed in detail in Supplementary Information and ref. 6. Thus, 
multiple evaluations of 16S protocols were undertaken before adopt- 
ing a single standardized protocol that ensured consistency in the 
high-throughput production. 

To maximize accuracy and consistency, protocols were evaluated 
primarily using a synthetic mock community of 21 known organisms® 
(Supplementary Table 5). Additional testing of the protocol was 
carried out on a subset of HMP samples (Supplementary Table 1). 
Collectively, these efforts resulted in adoption of a protocol to amplify 
and sequence samples using the Roche-454 FLX Titanium platform® 
(http://www.hmpdacc.org/doc/HMP_MDG_454_16S_Protocol.pdf). 
The HMP created both cell mixtures and genomic DNA extracts of the 
mock community (Supplementary Tables 2 and 5). A large body of 
metagenomic data (both 16S and WGS) (RES: HMMC) from these 
and other calibration experiments are available to the community to 
facilitate further benchmarking of new molecular and analytical 
approaches (Supplementary Table 3). 

The majority of the sample collection was targeted for 16S sequen- 
cing using the 454 FLX Titanium based strategy*®. The nucleotide 
sequence of the 16S rRNA gene consists of regions of highly conserved 
sequence, which alternate with nine regions or windows of variable 
nucleotide sequence that constitute the most informative portions of 
the gene sequence for use in taxonomic classification. A window 
covering number three (V3) to five (V5) variable regions (V35) of 
the 16S rRNA gene was chosen as the target for 4,879 samples. 
Sequence of a V1 to V3 (V13) window was also included for a subset 
of 2,971 samples to provide a complementary view of taxonomic 
profiles® (RES:HMR16S) (Table 1, Supplementary Figs 2, 3 and 
Supplementary Information). 

After adoption of the 16S protocol, including removal of multiple 
sources of potential artefacts or bias generated by 16S sequencing 
using pyrosequencing””, a variety of approaches for accurate diversity 
estimation were developed and compared’. A 16S data processing 
pipeline was established using the mothur software package’® 
(Supplementary Information), which includes two optional low and 
high stringency approaches. The former provides an output favouring 
longer read lengths tailored towards taxonomic classification, the latter 
an output with more aggressive sequence error reduction tailored 
towards OTU construction (RES: HMMCP). A third complementary 
pipeline was also developed using the QIIME software package’ 
(Supplementary Information), which processes these data using an 


Body region Body site Total Totall6S 13 V13 read V35 V35 read Samples  TotalWGS Totalread Filtered Human Remaining Samples 

samples samples samples depth (M)* samples depth (M)* V13andV35 samples depth(G)+ reads reads readdepth 16S and 
(Seyi (%)8 (Gt WGs 
Gut Stool 352 337 93 14 328 24 184 136 1,720.7 15 1 1,450.6 124 
Oral cavity Buccal mucosa 346 330 84 i3 314 1.7 168 107 1,438.0 9 82 136.7 91 
Hard palate 325 325 79 12 310 17 164 1 10.9 20 25 5a 1 
Keratinized gingiva 335 329 83 i3 319 17 173 6 72.3 5 47 34.4 0 
Palatine tonsils 337 332 89 1.2 315 19 172 6 74.8 2 80 35 1 
Saliva 315 310 66 0.9 292 15 148 5 55,7 1 91 4.2 0 
Subgingival plaque 334 328 86 2 314 18 172 7 92.1 5 79 53 1 
Supragingival plaque 345 331 92 12 316 19 177 115 1,500.7 is 40 674.8 101 
Throat 331 325 76 1.0 312 1.7 163 y 78.8 4 to 3.6 1 
Tongue dorsum 348 332 93 i3 320 2.0 181 122 1,620.1 15 19 1,084.3 106 
Airway Anterior nares 316 302 69 1.0 283 2 150 84 1129.9. 3 96 43 70 
Skin Left antecubital fossa 269 269 58 0.7 221 0.5 110 0) NA NA NA 0 NA 
Left retroauricularcrease 313 312 88 1.6 295 15 171 9 126.3 9 73 22:1 8 
Right antecubital fossa 274 274 58 0.7 229 0.5 113 6) NA NA NA 0 NA 
Right retroauricularcrease 319 316 90 14 304 1.6 178 15 181.9 18 59 42.4 12 
Vagina Mid-vagina 145 143 91 0.6 140 1.0 88 2 22.6 ) 99 0.2 0 
Posterior fornix 152 142 89 0.6 136 1.0 83 53 702.1 6 90 25.2 43 
Vaginal introitus 142 140 87 0.6 131 0.9 78 3 36.5 1 98 0.6 ut 
Total 5298 5,177 2,971 19 4,879 26.3 2673 681 8,863.3 il 49 3,538.1 560 


NA, not applicable. 

*1x10® reads post-processing with the mothur pipeline (Supplementary Information). 
+1x 10° reads (Supplementary Information). 

¢ Fraction of reads with low quality bases that were removed (Supplementary Information). 
§ Fraction of human reads that were removed (Supplementary Information). 
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OTU-binning strategy to which taxonomic classification is added 
(RES:HMQCP). All pipelines result in highly comparable views of 
the human microbiome. 


Metagenomic assembly and gene cataloguing 
Approximately 749 samples representing targeted body sites were 
chosen for WGS sequencing using the Illumina GAIIx platform with 
101-base-pair paired-end reads. From a high-quality set of 681 samples 
an average depth of 13 Gb (+ 4.3) was achieved per sample, collectively 
producing a total of 8.8 Tb (RES: HMIWGS) (Table 1). Theoretically, 
these per sample data are sufficient to cover a 3 Mb bacterial genome 
present at only 0.8% abundance with a probability of 90% (M. C. 
Wendl et al., manuscript submitted). In addition, 12 stool samples 
were simultaneously sequenced using the 454 FLX Titanium platform 
(RES: HM4WGS). Comparisons between the centres demonstrated 
high consistency of target sequencing depth and success rates*. After 
development of a protocol for removing reads resulting from human 
DNA contamination (Supplementary Information), 49% of the reads 
were targeted for removal as human (for information on authorized 
access to these reads, see Supplementary Information). Samples 
collected from soft tissue tended to have higher human contamination 
(for example, mid-vagina (96%), anterior nares (82%) and throat 
(75%)). Preparations from saliva were also high in human DNA 
sequence (80%), whereas stool contained a relatively low abundance 
of human reads (up to 1%) (Supplementary Fig. 4). 

After application of a quality control protocol that includes human 
sequence removal, quality filtering and trimming of reads (Sup- 
plementary Information), the remaining 3.5Tb from 681 samples 
were subjected to a three-tiered complementary analysis strategy 
(Supplementary Information) of reference genome mapping (which 
was able to use ~57% of the data), assembly and gene prediction 
(~50% of the data), and metabolic reconstruction (~36% of the 
data). This combined strategy facilitated the extraction of maximal 
organismal and functional information. 

Metagenomic assemblies were generated for all available samples 
using an optimized SOAPdenovo protocol with parameters designed 
to produce substrates for downstream analyses such as gene and 
function prediction, resulting in a total of 41 million contigs 
(RES:HMASM) (Supplementary Information). Reads that remained 
unassembled were pooled across individual body sites and re- 
assembled using the same approach, resulting in an additional 
4,200,672 contigs (RES: HMBSA). These body-site-specific assemblies 
are aimed at reconstructing organisms that represent too small a 
fraction in any individual sample to assemble but are found among 
many individuals. For 12 stool samples both Illumina and 454 FLX 
Titanium data (RES:HM4WGS) were generated, allowing a hybrid 
assembly approach using Newbler (Supplementary Information) 
(RES: HMHASM). Overall, the assembly statistics recovered varied 
substantially depending on body site and community complexity 
(Supplementary Fig. 5). However, our results indicate that, for the 
assembly strategy we used, metagenomic assembly quality plateaus at 
approximately 6Gb of microbial sequence coverage for a sample 
possessing a microbial community structure similar to that of stool 
samples (Supplementary Fig. 6). 

A WGsS-based perspective of community membership was obtained 
by aligning the reads to a set of 1,742 finished bacterial, 131 archaeal, 
3,683 viral and 326 microeukaryotic reference genomes’ 
(RES:HMREFG) (Supplementary Information) representing a broad 
taxonomic range from each of these four domains. A total of 57.6% of 
the high-quality microbial reads could be associated with a known 
genome (ranging from 33-77% for anterior nares and posterior fornix, 
respectively) (RES: HMSCP). The overwhelming majority of mapped 
sequences originated from bacteria (99.7%), while the remaining reads 
mapped to microeukaryotes (0.3%) or archaea (<0.01%) (Supplemen- 
tary Information). 
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Two complementary approaches were used to summarize overall 
function and metabolism of the human microbiome, producing two 
primary data sets of annotations (RES: HMMRC and RES:HMGI) 
(Supplementary Information) and additional secondary analyses 
(RES:HMGS, HMHGI, HMGC and HMGOI) (Supplementary 
Information) available to the community for further interroga- 
tion. The first primary data set of annotations was produced by 
mapping individual shotgun reads to characterized protein families’’ 
(RES:HMMRC). The second was produced from functionally 
annotated gene predictions generated from the metagenomic 
assemblies (RES: HMGI), which were subsequently grouped accord- 
ing to high-level biological processes and to selected additional 
processes specific to metabolism and regulation’* (RES:HMGS) 
(Supplementary Tables 6, 7 and Supplementary Fig. 7). 


HMP data generation and analysis lessons 


A key manner in which the HMP resources will serve to guide future 
studies of the microbiome is by enabling informed decisions regard- 
ing sampling protocols and genomic DNA preparation (K. Aagaard et 
al., manuscript submitted), sequencing depth (M. C. Wendl et al., 
manuscript submitted), statistical power (P. S. La Rosa et al., manu- 
script submitted) and metagenomic data type. As indicated in Table 1, 
the consortium successfully amplified 16S sequences to our target 
depth at all 18 body sites, with the fewest sequences recovered con- 
sistently from the antecubital fossae. The amount of host human 
DNA recovered and the finest level of OTU resolution varied for 
16S sequences among body sites° (Supplementary Figs 3 and 4). 

From our WGS investigations, a series of protocols (http://hmpdacc. 
org/tools_protocols/tools_protocols.php) have been established to 
process large volumes of short-read WGS data and to annotate and 
examine these data through both a multi-tiered assembly approach 
and as single reads'*. An investigator's choice of metagenomic 
technologies can thus be guided not only by a 16S versus WGS 
dichotomy, but also by the expected fraction of host sequence and 
the appropriate 16S region targeting the dominant taxa at each body 
site (Supplementary Figs 2-6 and 8). 

Together, these data sets represent comprehensive and comple- 
mentary views of the human microbiome, as shown by comparing 
organismal (Fig. 1a) and gene (Fig. 1b) catalogues, and the ratio of 
genes contributed per OTU (Fig. 1c). The discovery rate of new gene 
clusters (as determined by annotation of assembled WGS data) is in 
general detected more slowly relative to organismal discovery (as 
determined by OTU data) owing to the fragmentary nature of these 
community reads and assemblies despite high sequence depth 
(Fig. la, b and Supplementary Fig. 9), and the number of genes con- 
tributed per OTU varies by body site (Fig. 1c and Supplementary 
Information). However, in general, these results highlight an import- 
ant point for consideration of further microbiome investigations 
using these data sets, as they suggest that the majority of the common 
taxa and genes present in this reference population have been 
detected. 

We additionally compared the gut community gene catalogue 
sampled by the HMP with that of MetaHIT in terms of total detected 
gene counts. The HMP recovered more total non-redundant gene 
counts (5,140,472) than reported by MetaHIT (3,299,822)°, probably 
reflecting a combination of the increased sequence depth obtained by 
the HMP (11.7 Gb HMP, 4.5 Gb MetaHIT on average) and differences 
in data generation and processing”. 

The two non-redundant sets of gene sequences were subsequently 
combined and compared by matches to a database of orthologous 
groups’® of functionally annotated genes. Approximately 57% of the 
orthologous groups recovered by this method overlapped between the 
data sets, while an additional 34% versus 10% were unique to the 
HMP and MetaHIT, respectively (Supplementary Fig. 10, Supplemen- 
tary Table 8 and Supplementary Information). After removal of genes 
that received any orthologous group assignment, the remaining novel 
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Figure 1 | Rates of gene and OTU discovery from HMP taxonomic and 
metagenomic data. a-c, Accumulation curves for OTU counts from 16S data 
(all body sites) (a), clustered gene index counts from metagenomic data (all 
applicable body sites) (b) and the ratio of average unique genes contributed 
versus unique OTUs encountered with increasing sample counts 

(c) (Supplementary Information). L, left; R, right. Ratios given for each curve in 


genes were subsequently clustered’’. Approximately 79% of the HMP- 
derived novel gene clusters were orthologous to one or more clusters in 
MetaHIT, while an additional 16% were unique to this study versus 5% 
for MetaHIT-derived data’ (Supplementary Fig. 11, Supplementary 
Table 8 and Supplementary Information). These results suggest that, 
for this body habitat, relatively similar gene catalogues were recovered 
despite differences in experimental design and protocols. However, a 
greater proportion of both annotated and unique novel genes were 
detected in the HMP data set, emphasizing the utility of sequencing 
depth in recovering gene function and, in particular, deriving rare 
function. These results further underscore the importance of large- 
scale sequence-based studies of the microbiome to characterize better 
its gene content and diversity. 


Human microbiome reference genomes 
The current goal for the reference genome component of the HMP is to 
sequence at least 3,000 reference bacterial genomes, and additional viral 
and microeukaryotic genomes, associated with the human body. Thus 
far, more than 800 genomes have been sequenced and are available from 
the NCBI and the DACC (http://hmpdacc.org/HMRGD). From an 
alignment of WGS reads to reference genomes (RES:HMREFG), 
approximately 26% from the total read set (46% of all reads that could 
be aligned) were matched to a subset of 223 HMP reference genomes 
(Supplementary Information and Supplementary Data). 

We continue to solicit community feedback for strains that will 
best benefit our attempts at understanding the breadth of human 
microbiome diversity. For example, a prioritized list of the 
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c represent the average number of unique genes contributed per unique OTU at 
the final sample count. Curves for stool, buccal mucosa and anterior nares 
suggest that the proportion of gene-to-taxa discovery has stabilized. In contrast, 
the curve for supragingival plaque suggests that relatively fewer new genes are 
being contributed per additional OTU. Error bars represent 95% confidence 
intervals. 


‘most wanted’ HMP taxa is being maintained (http://hmpdacc.org/ 
most_wanted/) with the goal of targeting these difficult to obtain 
organisms using both culture-based and single-cell approaches. 

A catalogue of all HMP reference genomes along with custom 
filtering, viewing, graphing and download options can be found at 
the DACC Project Catalogue (http://www.hmpdacc-resources.org/ 
hmp_catalog/main.cgi). In addition, comparative analyses of reference 
genomes are provided by the data warehouse and analytical systems, 
Integrated Microbial Genomes/HMP (http://www.hmpdacc-resources. 
org/cgi-bin/imgm_hmp/main.cgi). Cultures of all HMP reference strains 
are required to be made publicly available through the Biodefense and 
Emerging Infections Research Resources Repository (BEI). Information 
on strain acquisition can be found at the DACC (http://hmpdacc.org/ 
reference_genomes/reference_genomes.php) and BEI (http://www. 
beiresources.org/tabid/1901/stabid/1901/CollectionLinkID/4/Default. 
aspx). 


Conclusion 


An overarching goal of this multi-year, multi-centre project is the 
generation of a community resource to advance research efforts 
related to the microbiome. The result is a collection of 11,174 primary 
biological specimens representing the human microbiome, as well as 
corresponding blood samples from the human donors, which are 
being reserved for sequencing at a future date and from which cell 
lines will be developed. A variety of new protocols were developed 
to enable a project of this scope; these include methods for donor 
recruitment, laboratory and sequence processing, and analysis of 
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16S and WGS sequence and profiles. These resources serve as models 
to guide the design of similar projects. Studies with a primary focus on 
disease can use this reference for comparative purposes, including 
detecting shifts in microbial taxonomic and functional profiles, or 
identification of new species not present in healthy cohorts that 
appear under disease conditions. The catalogue described in this study 
is, to our knowledge, the largest and most comprehensive reference set 
of human microbiome data associated with healthy adult individuals. 
Collectively the data represent a treasure trove that can be mined to 
identify new organisms, gene functions, and metabolic and regulatory 
networks, as well as correlations between microbial community struc- 
ture and health and disease*. Among other future benefits, this resource 
may promote the development of novel prophylactic strategies such as 
the application of prebiotics and probiotics to foster human health. 


METHODS SUMMARY 


As part of a multi-institutional collaboration, the HMP human subjects study was 
reviewed by the Institutional Review Boards (IRBs) at each sampling site: the BCM 
(IRB protocols H-22895 (IRB no. 00001021) and H-22035 (IRB no. 00002649)); 
Washington University School of Medicine (IRB protocol HMP-07-001 (IRB no. 
201105198)); and St Louis University (IRB no. 15778). The study was also reviewed 
by the J. Craig Venter Institute under IRB protocol 2008-084 (IRB no. 00003721), 
and at the Broad Institute of MIT and Harvard the study was determined to be 
exempt from IRB review. All study participants gave their written informed con- 
sent before sampling and the study was conducted using the Human Microbiome 
Project Core Sampling Protocol A. Each IRB has a federal-wide assurance and 
follows the regulations established in 45 CFR Part 46. The study was conducted in 
accordance with the ethical principles expressed in the Declaration of Helsinki and 
the requirements of applicable federal regulations. 
All further details are in Supplementary Information. 
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Gut microbial communities represent one source of human genetic and metabolic diversity. To examine how gut 
microbiomes differ among human populations, here we characterize bacterial species in fecal samples from 531 
individuals, plus the gene content of 110 of them. The cohort encompassed healthy children and adults from the 
Amazonas of Venezuela, rural Malawi and US metropolitan areas and included mono- and dizygotic twins. Shared 
features of the functional maturation of the gut microbiome were identified during the first three years of life in all 
three populations, including age-associated changes in the genes involved in vitamin biosynthesis and metabolism. 
Pronounced differences in bacterial assemblages and functional gene repertoires were noted between US residents and 
those in the other two countries. These distinctive features are evident in early infancy as well as adulthood. Our findings 
underscore the need to consider the microbiome when evaluating human development, nutritional needs, physiological 


variations and the impact of westernization. 


Genetic variation between human populations is typically viewed as 
differences in the allele frequencies of shared Homo sapiens genes. 
Another source of genetic and metabolic diversity resides in differ- 
ences in the representation of the millions of genes and myriad gene 
functions within our gut microbial communities’ *. Sampling a broad 
population of healthy humans representing different ages and cultural 
traditions offers an opportunity to discover how our gut microbiomes 
evolve within a lifespan, vary between populations, and respond to 
our changing lifestyles'*°. Therefore, we conducted a demonstration 
project to address the question of whether there are discernible patterns 
of functional maturation of the gut communities of healthy infants and 
children living in geographically and culturally distinct settings. 

Fecal samples were obtained from individuals in families of Guahibo 
Amerindians residing in two villages (Platanillal and Coromoto), sepa- 
rated by 10 miles, and located near Puerto Ayacucho in the Amazonas 
State of Venezuela (see Supplementary Table la, b for information 
about their diets). Fecal samples were also procured from members 
of families living in four rural communities of Malawi located within 
10-70 miles of one another (Chamba, Makwhira, Mayaka and Mbiza). 
Lifestyles in these villages are very similar, and diets are relatively 
monotonous, dominated by maize (Supplementary Table 1c). In addi- 
tion, we sampled families distributed across the United States, includ- 
ing the greater metropolitan areas of St Louis, Philadelphia and 
Boulder. The sampled populations included parents and siblings, 
and, in the United States and Malawi, monozygotic and dizygotic twin 
pairs. A total of 531 individuals (151 families) were studied: 115 indi- 
viduals (34 families) from Malawi; 100 individuals (19 families) from 
Venezuela; and 316 individuals (98 families) from the United States 
(see Supplementary Table 2 for subject characteristics; note that all 


except 35 adults and one child from the United States were explicitly 
recruited for this study). 

DNA was extracted from a single fecal sample donated by each 
person. Variable region 4 (V4) of bacterial 16S ribosomal RNA genes 
present in each fecal community was amplified by PCR, and the 
resulting amplicons were sequenced on an Illumina HiSeq 2000 
instrument (n = 1,803,250 + 562,877 (mean + s.d.) reads per fecal 
sample; 1,093,740,274 total reads; Supplementary Table 2a) to define 
the phylogenetic types (phylotypes) present. Species-level bacterial 
phylotypes were defined as organisms sharing =97% nucleotide 
sequence identity in the V4 regions of their 16S rRNA genes'®. In 
addition, we characterized functions encoded in community DNA 
by performing multiplex shotgun 454 pyrosequencing of fecal DNA 
from a subset of 110 fecal samples, encompassing 43 families with 
members matched as closely as possible for age (155,890 + 87,083 
reads per sample; total size of data set, 5.9 Gb; Supplementary Table 
2b). The resulting shotgun reads were annotated with Kyoto 
Encyclopedia of Genes and Genomes (KEGG) Orthology group 
(KO) assignments and with Enzyme Commision (EC) numbers 
(KEGG version 58). 


Taxonomic changes as a function of age and population 

Many reports have examined the bacterial species content of the 
gastrointestinal tracts of infants and children within one population 
using culture-based methods. Far fewer studies have attempted to 
compare the gut communities of humans living in markedly different 
socio-economic, geographic and cultural settings'’'’. Culture- 
independent techniques have been used to define the gut microbiota 
at various points in postnatal development*”’, but have been limited 
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by the analytic methods used, by the low number of subjects examined, 
or by the scope of the populations surveyed. These studies have 
nonetheless provided important insights. Using 16S rRNA gene-based 
microarrays'’, a recent study found considerable intra- and inter- 
personal variation in fecal bacterial community structures during the 
first year of life in 12 unrelated children and 1 twin pair. Interpersonal 
variation was less within the twin pair, and intrapersonal variation 
decreased as a function of age. A quantitative PCR study of five 
bacterial taxa in the fecal microbiota of 1,032 Dutch infants at 1 month 
of age'’ documented differences based on birth mode (Caesarian 
versus vaginal; also see ref. 8). 

We collected bacterial V4 16S rRNA data from 326 individuals aged 
0-17 years (83 Malawian, 65 Amerindian and 178 US residents), plus 
202 adults aged 18-70 years (31 Malawians, 35 Amerindians and 136 
US residents). The 16S rRNA data sets were first analysed using 
UniFrac, an algorithm that measures similarity among microbial 
communities based on the degree to which their component taxa 
share branch length on a bacterial tree of life’®. There were several 
notable findings. First, the phylogenetic composition of the bacterial 
communities evolved towards an adult-like configuration within the 
three-year period after birth in all three populations (Fig. 1a and 
Supplementary Fig. 1). Second, interpersonal variation was signifi- 
cantly greater among children than among adults; this finding was 
robust to geography (Fig. 1b; see also ref. 4). Third, there were 
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Figure 1 | Differences in the fecal microbial communities of Malawians, 
Amerindians and US children and adults. a, UniFrac distances between 
children and adults decrease with increasing age of children in each population. 
Each point shows the average distance between a child and all adults unrelated 
to that child but from the same country. Results are derived from bacterial V4 
16S rRNA data sets. b, Large interpersonal variations are observed in the 
phylogenetic configurations of fecal microbial communities at early ages. 
Malawian and Amerindian (Amr) children and adults are more similar to one 
another than to US children and adults. UniFrac distances were defined from 
bacterial V4 16S rRNA data generated from the microbiota of 181 unrelated 
adults (=18 years old) and 204 unrelated children (n = 31 Malawians 0.03-3 
years old, 21 3-17 years old; 30 Amerindians 0.08-3 years old, 29 3-17 years 
old; 31 US residents 0.08-3 years old, 62 sampled at 3-17 years of age). 

*P< 0.05, **P < 0.005 (Student’s t-test with 1,000 Monte Carlo simulations). 
See Supplementary Table 3 for a complete description of the statistical 
significance of all comparisons shown. c, PCoA of unweighted UniFrac 
distances for the fecal microbiota of adults. PC, principal coordinate. 
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significant differences in the phylogenetic composition of fecal micro- 
biota between individuals living in the different countries, with espe- 
cially pronounced separation occurring between the US and the 
Malawian and Amerindian gut communities; this was true for indi- 
viduals aged 0-3 years, 3-17 years, and for adults (Fig. 1b and 
Supplementary Table 3). Unsupervised clustering using principal 
coordinates analysis (PCoA) of UniFrac distance matrices indicated 
that age and geography/cultural traditions primarily explain the vari- 
ation in our data set, in which US microbiota clustered separately 
from non-US microbiota along principal coordinate 1 (Fig. 1c and 
Supplementary Fig. 2). However, within the non-US populations, 
separation between Malawians and Amerindians was also observed 
(along principal coordinate 3 in the case of adults; Supplementary Fig. 
2f). We did not find any significant clustering by village for Malawians 
and Amerindians or by region within the United States. Fourth, 
bacterial diversity increased with age in all three populations 
(Fig. 2a, b). The fecal microbiota of US adults was the least diverse 
compared with the two other populations (Fig. 2c, P< 0.005, analysis 
of variance (ANOVA) with Bonferroni post-hoc test): these differ- 
ences were evident in children older than 3 years of age (P < 0.005, 
ANOVA with Bonferroni post-hoc test), but not in younger subjects. 

We next used the non-parametric Spearman rank correlation to 
determine which bacterial taxa change monotonically with increasing 
age within and between the three sampled populations. We only 
considered children who were breastfed and used data sets obtained 
from the V4 region of the 16S rRNA gene as well as data sets of 
shotgun pyrosequencing reads from the fecal microbiomes of the 
110 sampled individuals (24 babies (0.6-5 months old), 60 children 
and adolescents (6 months to 17 years old) and 26 adults). Shotgun 
reads were mapped to 126 sequenced human gut-derived microbial 
species (Supplementary Table 4). The advantage of using these 126 
gut microbes as a reference database is that spurious hits of shotgun 
microbiome reads to taxa that are not present in the gut are minimized. 
Nonetheless, when we repeated the entire analysis, blasting against 
1,280 genomes in KEGG, the results were similar (Supplementary 
Fig. 3). Phylotypes belonging to Bifidobacterium longum exhibited a 
significant decline in proportional representation with increasing age 
in all three populations (Supplementary Fig. 3a). Most (75 + 20%) 
shotgun and 16S rRNA V4 sequences in all babies mapped to members 
of the Bifidobacterium genus. Bifidobacteria continued to dominate 
fecal communities throughout the first year of life, although their 
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Figure 2 | Bacterial diversity increases with age in each population. a-c, The 
number of observed OTUs sharing =97% nucleotide sequence identity plotted 
against age for all subjects (a), during the first 3 years of life (b), and adults 
(c). Mean + s.e.m. are shown in c. *P < 0.05, **P < 0.005 (ANOVA with 
Bonferroni post-hoc test). 


14 JUNE 2012 | VOL 486 | NATURE | 223 


©2012 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


proportional representation diminished during this period, in agree- 
ment with the results of several studies of small numbers of children**’ 
(Supplementary Fig. 3). Supplementary Table 5 lists the species-level 
bacterial taxa whose representation changes significantly with age in all 
three populations, as well as species that change in a population- 
specific manner as defined from analysis of the shotgun sequencing 
data that were available from 110 of the 531 individuals. 

We also used Random Forests, a supervised machine-learning 
technique’’, and the V4 16S rRNA data sets obtained from 528 indi- 
viduals to identify bacterial species-level operational taxonomic units 
(OTUs) that differentiate fecal community composition in children 
and adults within and between the three populations. The purpose ofa 
classifier such as Random Forests is to learn a function that maps a set 
of input values or predictors (here, relative OTU abundances in a 
community) to a discrete output value (here, US versus non-US 
microbiota). Random Forests is a particularly powerful classifier that 
can exploit nonlinear relationships and complex dependencies 
between OTUs. The measure of the success of the method is its ability 
to classify unseen samples correctly, estimated by training it on a 
subset of samples, and using it to classify the remaining samples 
(cross-validation). The cross-validation error is compared with the 
baseline error that would be achieved by always guessing the most 
common category. As an added benefit, Random Forests assigns an 
importance score to each OTU by estimating the increase in error 
caused by removing that OTU from the set of predictors. In our 
analysis, we considered an OTU to be highly predictive if its import- 
ance score was at least 0.001; all error estimates and OTU importance 
scores were averaged over 100 rarefactions at the same sample size for 
each community (305,631 sequences) to control for sequencing effort. 

Random Forests analysis confirmed the dominance of Bifido- 
bacterium in the baby microbiota (Supplementary Table 6a). For 
adults, Random Forests revealed distinct community signatures for 
Western (US) and non-Western individuals (baseline error = 0.289, 
cross-validation error = 0.011 + 0.000). Of the 92 highly predictive 
species-level OTUs shown in Supplementary Table 6b, 73 were over- 
represented in non-US adults, and 23 out of the 73 were assigned to 
the Prevotella genus. Malawians and Amerindians could also be dis- 
tinguished from each other, although the difference was less extreme 
than the US versus non-US comparison (baseline error = 0.407, 
cross-validation error = 0.018 + 0.009, 56 highly predictive OTUs; 
Supplementary Table 6c). Only 28 OTUs distinguished US and 
non-US infants (Supplementary Table 6d). Intriguingly, three 
OTUs assigned to the Prevotella genus were overrepresented in the 
US infant microbiota, unlike the result observed in adults (Supplemen- 
tary Table 6d). Twenty-three OTUs discriminated Malawian and 
Amerindian baby microbiomes, 20 of which were overrepresented in 
the latter: most belonged to the Enterococcaceae family (Supplemen- 
tary Table 6d). Thus, a Western (US) lifestyle seems to affect the 
bacterial component of the gut microbiota systematically, although 
this influence is subtle compared with the high degree of variability 
observed in infants and children within each population (perhaps 
analogous to human genetic variability, in which variation among 
populations is small compared to variation within populations). 

Confirming the importance of Prevotella as a discriminatory taxon, a 
recent study also showed that this genus was present in higher abund- 
ance in the fecal microbiota of children living in West Africa (Burkina 
Faso) compared with children living in Europe (Italy). Furthermore, a 
member of this genus is one of three bacterial species that, in European 
adults, distinguishes strongly among three clusters, or enterotypes, of 
gut microbiota configurations that are claimed to be reproducible 
across Western adult populations’*. Therefore, we asked whether the 
fecal microbiota of infants and adults in each of our three geographically 
distinct populations fell into natural discrete clusters’’. We did not find 
strong evidence for discrete clustering (see Methods), but rather for 
variation driven in adults by a trade-off between Prevotella and 
Bacteroides (Supplementary Fig. 4a). Including infants introduces a 
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new, strongly supported gradient driven by Bifidobacteria, generally 
orthogonal to the Bacteroides/Prevotella trade-off (Supplementary 
Fig. 4b). Clustering of sub-populations of increasing minimum age 
indicates that adult cluster membership is generally consistent, but that 
children between 0.6 and 1 year of age may be clustered with adults or 
with younger children, depending on whether the younger children are 
included in the analysis (Supplementary Fig. 4c). 

This clustering analysis suggests that some features of normal vari- 
ation in the bacterial composition of the gut microbiota, such as the 
Prevotella/Bacteroides trade-off, are highly reproducible even in 
human population subsets of reduced variability. However, a com- 
plete description of variation in the human gut microbiota will require 
a substantially broader cross-cultural and cross-age sampling. 
Importantly, the observed age-related and geographic patterns were 
also detected with lower sequencing coverage (see Supplementary 
Results for a discussion of the influence of sequencing depth on the 
performance of Random Forests and PCoA analyses (Supplementary 
Figs 5 and 6), and our analysis of non-bacterial taxa that vary with age 
and population (Supplementary Fig. 7)). 


Shared functional changes over time 


Few studies have described changes in the gene content of the gut 
microbiome as a function of age: the largest study reported so far was 
carried out in 13 healthy Japanese individuals (5 children, the youngest 
3 months old, plus 8 adults)*. Our shotgun sequencing data set from 
110 individuals allowed us to characterize the representation of func- 
tional gene groups (KEGG KO annotations and EC numbers) in 
microbiomes representing broader age groups (youngest 3 weeks), 
and several distinct geographic locations and cultural traditions. We 
used Hellinger distance measurements to show that just as children are 
significantly more different from one another than are adults in terms 
of their fecal bacterial community phylogenetic structure, they are also 
more different in terms of their repertoires of microbiome-encoded 
functions, as defined by the proportional representation of EC and KO 
assignments. Moreover, as with UniFrac distances, Hellinger distances 
were greater between the US and the other two populations at all ages 
sampled (Supplementary Fig. 8). Of interest is the concordance of 
patterns of covariance between the two data types: Procrustes analysis 
disclosed that the goodness of fit was significant (P < 0.001 with 1,000 
iterations) whether UniFrac (the most appropriate metric for 16S 
rRNA data) or Hellinger distances (for consistency with the method 
used on the KEGG EC and KEGG Orthology data) were used to reduce 
the OTU table (Supplementary Fig. 9a, b and data not shown). 
Annotation of shotgun reads from the microbiomes using the 
Clusters of Orthologous Groups (COG) database produced similar 
concordance with 16S rRNA data sets (Supplementary Fig. 9c). 

When examining KEGG EC profiles across 110 fecal microbiomes, 
we obtained the remarkable result that there were no ECs identified as 
being unique to adults (n = 26) or babies (less than 6 months old, 
n = 24). Moreover, the total number of ECs found in adults was not 
significantly different than the total number of ECs scored in babies 
(sampling normalized to coverage in Supplementary Fig. 10a). This 
finding was robust to geography. The fraction of sequences with 
assignable KEGG EC annotations declined with increasing age in all 
three populations (Supplementary Fig. 10b). This may be due to the 
increased complexity of the adult microbiome, with fewer represent- 
ative species characterized by genome sequencing, genetic manipula- 
tion or biochemically (also see Supplementary Results and 
Supplementary Figs 11 and 12 for a comparison of our data set to a 
published data set of fecal microbiomes sampled from 124 adults 
living in Denmark and Spain’). 

We used ShotgunFunctionalizerR”’, a software tool designed for 
metagenomic analysis and based on a Poisson model, to identify 1,008 
ECs whose proportional representation in fecal microbiomes differed 
significantly between all sampled breastfed babies and all adults 
irrespective of their geographic location; 530 were significantly higher 
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in adults (P < 0.0001, Supplementary Table 7). A prominent example 
of these shared age-related changes involves the metabolism of 
vitamins B12 (cobalamin) and folate. In contrast to folate, which is 
synthesized by microbes and plants, cobalamin is primarily produced 
by microbes”'. The gut microbiomes of babies are enriched in genes 
involved in the de novo biosynthesis of folate, whereas those of adults 
have a significantly higher representation of genes that metabolize 
dietary folate and its reduced form tetrahydrofolate (THF; Sup- 
plementary Fig. 13 and Supplementary Table 7). Unlike de novo folate 
biosynthetic pathway components, which decrease with age, the pro- 
portional representation of genes encoding most enzymes involved in 
cobalamin biosynthesis increases with age (Supplementary Figs 14, 15 
and Supplementary Table 7). The folate and cobalamin pathways are 
linked functionally by methionine synthase (EC2.1.1.13), which cata- 
lyses the formation of THF from 5-methyl-THF and L-homocysteine, 
requiring cobalamin as a cofactor; the representation of this enzyme 
also increases with age (Supplementary Fig. 13). 

The low relative abundance of ECs involved in cobalamin bio- 
synthesis in the fecal microbiomes of babies correlates with the lower 
representation of members of Bacteroidetes, Firmicutes and Archaea 
in their microbiota (see Supplementary Fig. 16 for Spearman correla- 
tion coefficients). Although the biosynthetic pathway for cobalamin is 
well represented in the genomes of these organisms (Supplementary 
Fig. 16), Bifidobacterium, Streptococcus, Lactococcus and Lactobacillus, 
which dominate the baby gut microbiota (Supplementary Table 5 
and Supplementary Fig. 3), are deficient in these genes (Supplemen- 
tary Fig. 16). By contrast, several of these early gut colonizers contain 
ECs involved in folate biosynthesis and metabolism (Supplementary 
Fig. 16). The conventional view of the developing infant gut is that the 
main change is in the representation of Bifidobacteria. Although dif- 
ferences in the representation of Bifidobacteria contribute to this effect, 
differences in vitamin metabolism among the rest of the bacteria 
remain even when all Bifidobacteria reads were excluded (data not 
shown). These changes in vitamin biosynthetic pathway representa- 
tion in the microbiome correlate with published reports indicating that 
blood levels of folate decrease and cobalamin increase with age”. 

Besides cobalamin and folate, the relative abundance of ECs 
involved in the biosynthesis of vitamins B7 (biotin) (biotin synthase, 
EC2.8.16) and B1 (thiamine) (thiamine-phosphate diphosphorylase, 
EC2.5.1.3) are significantly higher in adult microbiomes than the 
microbiomes of babies (Supplementary Fig. 17 and Supplementary 
Table 7). Together, these findings suggest that the microbiota should 
be considered when assessing the nutritional needs of humans at 
various stages of development. 

Random Forests analysis asks a different statistical question from 
ShotgunFunctionalizeR: that is, which genes or species are most 
discriminatory among different class labels, rather than which are 
most over/underrepresented, and tends to identify fewer features than 
ShotgunFunctionalizeR when applied to the same data. Random 
Forests analysis yielded 107 ECs that best discriminate the adult 
and baby microbiomes (Supplementary Table 7). These predictive 
ECs were among the most significantly different ECs determined by 
ShotgunFunctionalizeR and included ECs involved in the metabolism 
of cobalamin and folate (Supplementary Table 7). In addition, 
Random Forests showed that ECs involved in fermentation, 
methanogenesis and the metabolism of arginine, glutamate, aspartate 
and lysine were higher in the adult microbiomes, whereas ECs 
involved in the metabolism of cysteine and fermentation pathways 
found in lactic acid bacteria (acetolactate decarboxylase (EC4.1.1.5) 
and 6-phosphogluconate dehydrogenase (EC1.1.1.44)) were mainly 
represented in baby microbiomes (Supplementary Fig. 17). 

Comparison of the representation of KEGG KOs between baby and 
adult microbiomes yielded essentially the same results as those 
reported with ECs. The only new finding was the overrepresentation 
of KEGG KOs assigned to a wide variety of ATP-binding cassette 
(ABC) transporters in baby microbiomes (Supplementary Table 7b). 
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Population- and age-specific differences 
ShotgunFunctionalizeR, Random Forests and Spearman rank correla- 
tion analyses were all used to compare EC representation in fecal 
microbiomes as a function of predefined categories of geographic 
location and age. A total of 476 ECs were identified as being signifi- 
cantly different in US versus Malawian and Amerindian breastfed 
babies (P < 0.0001, ShotgunFunctionalizeR; Supplementary Table 8). 
The most prominent differences involved pathways related to 
vitamin biosynthesis and carbohydrate metabolism. Malawian and 
Amerindian babies had higher representation of ECs that were com- 
ponents of the vitamin B2 (riboflavin) biosynthetic pathway (Fig. 3a 
and Supplementary Fig. 18). These differences were not evident in 
adults (Supplementary Table 7). Riboflavin is found in human milk 
and in meat and dairy products. We did not measure the levels of these 
vitamins in mothers and in their breastmilk in the sampled popula- 
tions, although it is tempting to speculate that the observed differences 
in baby microbiomes may represent an adaptive response to vitamin 
availability. 

Studies in gnotobiotic mouse models indicate that the ability of 
members of the microbiota to access host-derived glycans plays a 
key part in establishing a gut microbial community~*™*. As expected*”, 
compared with adults, baby microbiomes were enriched in ECs 
involved in the foraging of glycans represented in breastmilk and 
the intestinal mucosa (mannans, sialylated glycans, galactose and 
fucosyloligosaccharides; Supplementary Table 7). Several genes 
involved in using these host glycans are significantly overrepresented 
in Amerindian and Malawian baby microbiomes compared with US 
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Lipoyl(octanoyl) transferase EC2.3.1.181 

Threonine-phosphate decarboxylase EC4.1.1.81 
Adenosylcobinamide-phosphate synthase EC6.3.1.10 

Hydrogenobyrinic acid a,c-diamide synthase (glutamine-hydrolysing) EC6.3.5.9 
Adenosylcobyric acid synthase (glutamine-hydrolysing) EC6.3.5.10 
Nicotinate-nucleotide--dimethylbenzimidazole phosphoribosyltransferase EC2.4.2.21 
Choloylglycine hydrolase EC3.5.1.24 

Mercury(Il) reductase EC1.16.1.1 

Phenylacetate--CoA ligase EC6.2.1.30 


Figure 3 | Differences in the functional profiles of fecal microbiomes in the 
three study populations. Examples of KEGG ECs that showed the largest 
differences, as determined by Random Forests and ShotgunFunctionalizeR 
analyses, in proportional representation between US and Malawian/ 
Amerindian populations. Shown are the relative abundances of genes encoding 
the indicated ECs (normalized by Z-score across all data sets). a, UPGMA 
(unweighted pair group method with arithmetic mean) clustering of 10 US, 10 
Malawian and 6 Amerindian baby fecal microbiomes. b, UPGMA clustering of 
16 US, 5 Malawian and 5 Amerindian adult fecal microbiomes. 
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baby microbiomes, most notably exo--sialidase and «-L-fucosidase 
(Fig. 3a and Supplementary Table 8). These population-specific bio- 
markers may reflect differences in the glycan content of breastmilk. In 
fact, the representation of these glycoside hydrolases decreases as 
Malawian and Amerindian babies mature and transition to a diet 
dominated by maize, cassava and other plant-derived polysaccharides. 
By contrast, o-fucosidase gene representation in the US infants 
increases with age and as they become exposed to diets rich in readily 
absorbed sugars (Supplementary Fig. 19d and Supplementary Table 9). 

Another biomarker that distinguishes microbiomes based on age 
and geography is urease (EC3.5.1.5). Urease gene representation is 
significantly higher in Malawian and Amerindian baby microbiomes 
and decreases with age in these two populations, unlike in the United 
States, where it remains low from infancy to adulthood (Fig. 3a and 
Supplementary Fig. 19e). Urea comprises up to 15% of the nitrogen 
present in human breastmilk”. Urease releases ammonia that can be 
used for microbial biosynthesis of essential and nonessential amino 
acids*®”’. Furthermore, urease has a crucial involvement in nitrogen 
recycling, particularly when diets are deficient in protein**”’. Under 
conditions in which dietary nitrogen is limiting, the ability of the micro- 
biome to use urea would presumably be advantageous to both the 
microbial community and host. Although urease is generally attributed 
to Helicobacter and Proteus spp.; the relative abundance of members of 
these two genera was low (<0.05%) and not significantly different 
between the three populations. Urease activity has been characterized 
previously in Streptococcus thermophilus*®. Our analysis of shotgun 
reads that matched to the 126 reference gut genomes showed that 
the representation of five species that possess EC3.5.1.5 (Bacteroides 
cellulosilyticus WH2, Coprococcus comes, Roseburia intestinalis, 
Streptococcus infantarius and S. thermophilus) was significantly higher 
in Malawian and Amerindian baby microbiomes than in US baby 
microbiomes (Supplementary Table 5). 

Further support of the role of diet in shaping the infant gut micro- 
biome comes from the differences detected between breastfed and 
formula-fed babies who were part of the US infant twin cohort (see 
Supplementary Results and Supplementary Figs 2c, 8 and 20). 


Differences in adult fecal microbiomes 


Annotation of the shotgun sequencing data sets yielded a total of 
1,349 ECs in the 26 adults surveyed: ShotgunFunctionalizeR showed 
that the representation of genes encoding 893 of these ECs was 
significantly different in US versus Malawian/Amerindian fecal 
microbiomes (P< 0.005 after multiple comparison correction; 433 
overrepresented in US samples). By contrast, at this threshold only 
445 ECs were identified as different between Malawian and 
Amerindian adults (see Supplementary Table 10 for a complete list). 
The Random Forests classifier revealed 52 ECs that were best at dis- 
criminating US versus non-US adult fecal microbiomes. These ECs 
were also identified by ShotgunFunctionalizeR as the most signifi- 
cantly different (Supplementary Table 10). 

A typical US diet is rich in protein, whereas diets in Malawi and 
Amerindian populations are dominated by corn and cassava (Sup- 
plementary Table 1). The differences between US and Malawian/ 
Amerindian microbiomes can be related to these differences in diet. 
The ECs that were the most significantly enriched in US fecal micro- 
biomes parallel differences observed in carnivorous versus herbivorous 
mammals*’. ECs encoding glutamate synthase have higher propor- 
tional representation in Malawian and Amerindian adult microbiomes 
and are also higher in herbivorous mammalian microbiomes’' 
(Fig. 3b), whereas the degradation of glutamine was overrepresented 
in US as well as carnivorous mammalian microbiomes. Several ECs 
involved in the degradation of other amino acids were overrepresented 
in adult US fecal microbiomes: aspartate (EC4.1.1.12), proline 
(EC1.5.99.8), ornithine (EC2.6.1.13) and lysine (EC5.4.3.2) (Fig. 3b), 
as were ECs involved in the catabolism of simple sugars (glucose- 
6-phosphate dehydrogenase and 6-phosphofructokinase), sugar 
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substitutes (L-iditol 2-dehydrogenase, which degrades sorbitol), and 
host glycans (o-mannosidase, f-mannosidase and «-fucosidase; 
Fig. 3b). By contrast, a-amylase (EC3.2.1.1), which participates in 
the degradation of starch, was overrepresented in the Malawian and 
Amerindian microbiomes, reflecting their corn-rich diet. 

US microbiomes also had significant overrepresentation of ECs 
involved in vitamin biosynthesis (cobalamin (Fig. 3b and Supplemen- 
tary Fig. 15), biotin and lipoic acid (Fig. 3b)), in the metabolism of 
xenobiotics (phenylacetate CoA ligase (EC6.2.1.30), which participates 
in the metabolism of aromatic compounds, and mercury reductase 
(EC1.16.1.1)), and in bile salt metabolism (choloylglycine hydrolase 
(EC3.5.1.24), perhaps reflecting a diet richer in fats (Fig. 3b)). 


Effects of kinship on the microbiome across countries 

Differences in social structures may influence the extent of vertical 
transmission of the microbiota and the flow of microbes and micro- 
bial genes among members of a household. Differences in cultural 
traditions also affect food, exposure to pets and livestock, and many 
other factors that could influence how and from where a gut micro- 
biota/microbiome is acquired. We previously observed that adult 
monozygotic twins are no more similar to one another in terms of 
their gut bacterial community structure than are adult dizygotic 
twins’. This result suggests that the overall heritability of the micro- 
biome is low. We confirmed that the phylogenetic architecture of the 
fecal microbiota of monozygotic Malawian co-twins <3 years of age is 
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Figure 4 | Differences in the fecal microbiota between family members 
across the three populations studied. UniFrac distances between the fecal 
bacterial communities of family members were calculated (n = 19 Amerindian, 
34 Malawian families, and 54 US families with teenage twins). DZ, dizygotic; 
MZ, monozygotic. Mean and s.e.m. values are plotted. The UniFrac matrix was 
permuted 1,000 times; P values represent the fraction of times permuted 
differences were greater than real differences. m, months; NS (not significant; 
P> 0.05), *P < 0.05, **P < 0.005. 
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no more similar than the microbiota of similarly aged dizygotic co- 
twins (n = 15 monozygotic and 6 dizygotic twin pairs). We found that 
this is also true for monozygotic and dizygotic twin pairs aged 1-12 
months (n = 16 twin pairs), as well as teenaged twins (13-17 years 
old; n = 50 pairs) living together in the United States (Fig. 4). 

Although biological mothers are in a unique position to transmit an 
initial inoculum of microbes to their infants during and after birth, 
our analysis of mothers of teenage US twins showed that their fecal 
microbiota were no more similar to their children than were those of 
biological fathers, and that genetically unrelated but co-habiting 
mothers and fathers were significantly more similar to one another 
microbially than were members of different families (Fig. 4; note that 
no fathers were sampled in Malawi and only four fathers in the 
Amerindian cohort). These latter observations emphasize the import- 
ance of a history of numerous common environmental exposures in 
shaping gut microbial ecology. Moreover, the similarity in the overall 
pattern of the effects of kinship on microbial community structure 
suggests that despite the large influence of cultural factors on which 
microbes are present in both children and adults in each population, 
the bases for the degree of similarity among members of a family are 
consistent across the three populations studied. 


Prospectus 


Our results emphasize that it is essential to sample a broad population of 
healthy humans over time, both in terms of their age, geography and 
cultural traditions, to discover features of gut microbiomes that are 
unique to different locations and lifestyles. In addition, we need to 
understand how the pressures of westernization are changing the micro- 
bial parts of our genetic landscape—changes that potentially mediate the 
suite of pathophysiological states correlated with westernization. 
Finally, given the need for global policies about sustainable agriculture 
and improved nutrition, it will be important to understand how we can 
match these policies not only to our varying cultural traditions but also 
to our varied gut microbiomes. 


METHODS SUMMARY 


Sample collection. Subjects were recruited for the present study using procedures 
approved by Human Studies Committees from Washington University in St 
Louis, Children’s Hospital of Philadelphia, the University of Colorado, 
Boulder, the University of Malawi, the University of Puerto Rico, and the 
Venezuelan Institute for Scientific Research. Each fecal sample was frozen within 
30 min of donation. 

Multiplex DNA sequencing. Extracted genomic DNA was subjected to multiplex 
Illumina sequencing of the V4 region of bacterial 16S rRNA genes, as well as 
multiplex 454 pyrosequencing of total community DNA. See Methods for further 
details about the analysis. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Isolation of fecal DNA and multiplex sequencing. Each participant provided a 
fecal specimen that was frozen within 30 min. All samples were stored at —80 °C 
and subjected to a common protocol for DNA extraction. Fecal samples were 
pulverized with a mortar and pestle at —80 °C. Genomic DNA was extracted from 
400 mg aliquots of frozen pulverized samples. Methods for multiplex Illumina 
sequencing of V4 amplicons have been described”. 

For multiplex shotgun 454 Titanium FLX pyrosequencing, each fecal com- 

munity DNA sample was randomly fragmented by nebulization (500-800 base 
pairs) and then labelled with a distinct Multiplex IDentifier (MID; Roche) accord- 
ing the manufacturer’s protocol (Rapid Library preparation for FLX Titanium, 
Roche). Equivalent amounts of 12 MID-labelled samples were pooled before each 
pyrosequencer run. 
Data analysis. 16S rRNA OTUs were picked from the Illumina reads using a 
closed-reference OTU picking protocol against the Greengenes database 
clustered at 97% identity (that is, uclust_ref: sequences are clustered against a 
reference database, and reads that do not match the database are excluded from 
further analyses) with uclust using the QIIME suite of software tools'® version 
1.3.0-dev (pick_otus.py parameters: -max_accepts 1 -max_rejects 8 -stepwords 
8 -word_length 8). Of the 1,093,740,274 Illumina reads from the V4 region of 
bacterial 16S rRNA genes that passed the QIIME quality filters, 87.1% 
(952,115,802) matched a reference sequence at =97% nucleotide sequence 
identity. Taxonomy assignments were associated with OTUs based on the tax- 
onomy associated with the Greengenes reference sequence defining each OTU. 
UniFrac distances between samples were calculated using the Greengenes ref- 
erence tree. Greengenes reference sequences, trees and taxonomy data used in this 
analysis can be found at: http://greengenes.lbl.gov/Download/Sequence_Data/ 
Fasta_data_files/Reference_OTUs_for_Pipelines/Caporaso_Reference_OTUs/ 
gg_otus_4feb2011.tgz. 

A table of OTU counts per sample was generated and used in combination with 
the tree to calculate « and f diversity. To generate unweighted UniFrac distance 
matrices, all communities were rarefied to 290,609 V4 16S rRNA reads per 
sample. Unweighted UniFrac rather than weighted UniFrac was used for analyses 
owing to the large differences in taxonomic representation among the samples. 
Nonetheless, the patterns were similar with weighted UniFrac (data not shown). 
Rarefaction analysis was conducted using the QIIME scripts multiple_rarefaction.py, 
alpha_diversity.py and collate_alpha.py. The QUIME metric ‘observed species’ was 
used to estimate o diversity in the data set. 

Clustering analysis. Testing for discrete clusters was performed on the rarefied 
versions of the 16S rRNA OTU relative abundance tables. OTU counts were 
binned into genus-level taxonomic groups according to the taxonomic assign- 
ments described earlier. Several distance measures were considered, including 
Jensen-Shannon divergence, Bray—Curtis and weighted/unweighted UniFrac 
distances. Clustering was performed via partitioning around medoids in the R 
package cluster**. The choice of number of clusters and quality of the resulting 
clusters were assessed by maximizing the silhouette index*’. Traditionally, 
silhouette indices of 0.5 or above have been considered evidence of reasonable 
clustering structure. Although some silhouette scores above 0.5 were found in this 
data set (for example, for two clusters when clustering all adult populations with 
Jensen-Shannon divergence), reclustering within different subpopulations (for 
example, individual countries) introduced new cluster boundaries with silhouette 
scores still near or above 0.5, indicating that silhouette index scores may need to 
be substantially above 0.5 to claim clustering structure for microbial enterotype 
testing. We also tested for discrete clusters using the prediction strength mea- 
sure’’, which showed negative results for all distances measures but unweighted 
UniFrac (prediction strength = 0.963 + 0.012 (mean + s.d.)). We estimated the 
s.d. by tenfold jackknifing. 

Shotgun sequences from fecal microbiomes. Shotgun reads were filtered using 
custom Perl scripts and publicly available software to remove (1) all reads <60 
nucleotides; (2) Titanium pyrosequencing reads with two continuous and/or 
three total degenerate bases (N); (3) all duplicates (a known artefact of pyrose- 
quencing), defined as sequences in which the initial 20 nucleotides are identical 
and that share an overall identity of >97% throughout the length of the shortest 
read*’; and (4) all sequences with significant similarity to human reference 
genomes (Blastn E-value threshold = 10°, bitscore = 50, percentage identity = 
75%) to ensure the continued de-identification of samples. 


Searches against the database of 126 human gut bacterial genomes were con- 
ducted with Blastn. A sequence read was annotated as the best hit in the database 
if the E-value was <10~°, the bit score was =50, and the alignment was at least 
95% identical between query and subject. Relative abundances of reads mapped to 
each of the 126 genomes were adjusted to genome sizes. Searches against the 
protein-coding component of the KEGG database (v58) and against COG (v8.3) 
were conducted with BLASTX. (Note that when we performed searches against a 
separate KEGG database of intergenic regions alone, very few hits were observed.) 
Counts were normalized to the mapped reads. In total, 40 + 8% reads were 
mapped to KEGG KOs and 56 + 11% to COG; 44 + 16% of the reads mapped 
to the 126 gut genomes using 95% sequence similarity cut-off. Unmapped reads 
were excluded from the analyses shown in the main text, although repeating the 
analyses including these reads had little effect on the results. To quantify the 
differences in KEGG EC profiles among fecal microbiomes, evenly rarefied 
matrices of EC counts were created with all samples, and Hellinger distances 
were calculated using QUIME. 

Spearman rank correlations were carried out using the R statistical software**. 
To identify bacterial taxa that change with increasing age in each population, the 
proportion of reads that map to each of the 126 reference sequenced human gut 
genomes in each fecal microbiome was identified. The relative abundance of reads 
from each genome was then correlated with age (years) for each geographic 
region. To identify genes encoding ECs that change with age, the proportion of 
reads annotated with each EC in each fecal microbiome was identified. The 
relative abundance of each EC was subsequently correlated with age (years) for 
each geographic region. 

Random Forests analysis. Random Forests analysis was applied as described in 
ref. 8, using the randomForest package in R® with 500 trees and all default 
settings. The generalization error was estimated using fivefold cross-validation 
for all comparisons involving adults from the 16S rRNA data; leave-one-out 
cross-validation was used for all other comparisons. For each comparison, the 
relevant subset of samples was extracted from the table of OTU or EC counts, and 
all singleton OTUs/ECs (or all OTUs/ECs present in fewer than 5 samples for the 
16S rRNA comparisons involving adults) were subsequently removed. Random 
Forests analysis was performed for each comparison on 100 rarefied versions of 
the data, and the average cross-validation error estimates and OTU/EC import- 
ance estimates were reported. Rarefaction depths were chosen manually to 
include all samples without exceptionally low total sequences. The chosen depth 
for each comparison and the resulting number of samples are shown in 
Supplementary Tables 6-8 and Supplementary Fig. 6. For the analysis shown 
in Supplementary Fig. 6a, we compared the generalization errors obtained when 
using 16S rRNA-based OTUs from the Illumina V4 data sets at various sequen- 
cing depths. For direct evaluation of the predictive strength of the Illumina-based 
OTUs, we rarefied at the lowest observed depth of 305,631 sequences for each 
classification task, as well as at sequencing depths of 100, 1,000, 10,000, 100,000 
and 1,000,000 reads per sample. The mean and s.d. of the cross-validation error 
were estimated for each classification tasking using ten independent 
rarefactions of the data. We also included the expected ‘baseline’ error obtained 
by a classifier that simply predicts the most common class label. 

Data deposition. DNA sequences have been deposited in MG-RAST (http:// 
metagenomics.anl.gov/) under accession numbers ‘qiime:850’ for Illumina V4 
16S rRNA data sets, and ‘qiime:621’ for fecal microbiome shotgun sequencing 
data sets. 
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NPR3 and NPR4 are receptors for the 
immune signal salicylic acid in plants 


Zheng Qing Fu'*, Shunping Yan!*, Abdelaty Saleh'*, Wei Wang", James Ruble”, Nodoka Oka®, Rajinikanth Mohan’, 


Steven H. Spoel*, Yasuomi Tada®, Ning Zheng” & Xinnian Dong! 


Salicylic acid (SA) is a plant immune signal produced after pathogen challenge to induce systemic acquired resistance. It 
is the only major plant hormone for which the receptor has not been firmly identified. Systemic acquired resistance in 
Arabidopsis requires the transcription cofactor nonexpresser of PR genes 1 (NPR1), the degradation of which acts as a 
molecular switch. Here we show that the NPR1 paralogues NPR3 and NPR4 are SA receptors that bind SA with different 
affinities. NPR3 and NPR4 function as adaptors of the Cullin 3 ubiquitin E3 ligase to mediate NPR1 degradation in an 
SA-regulated manner. Accordingly, the Arabidopsis npr3 npr4 double mutant accumulates higher levels of NPR1, and is 
insensitive to induction of systemic acquired resistance. Moreover, this mutant is defective in pathogen 
effector-triggered programmed cell death and immunity. Our study reveals the mechanism of SA perception in 
determining cell death and survival in response to pathogen challenge. 


After pathogen challenge, host cells have to make a life-and-death 
decision to fend off infection. Recognition of a pathogen effector by 
a host resistance protein can lead to effector-triggered immunity 
(ETI), characterized by rapid programmed cell death (PCD) known 
as the hypersensitive response’. The clearly defined boundary of the 
hypersensitive response indicates the presence of a mechanism that 
controls cell death and survival. Despite intense studies of plant 
mutants defective in controlling the spread of PCD’, the regulatory 
mechanism still remains a mystery. 

Localized PCD can induce systemic acquired resistance (SAR) 
through the production of the immune signal, salicylic acid (SA). 
SA triggers global transcriptional reprogramming and resistance to 
a broad spectrum of pathogens. The receptor for SA has been sought 
after for many years, mainly through biochemical purification of SA- 
binding proteins*®. However, genetic data for these SA-binding 
proteins, which include a catalase, a chloroplast carbonic anhydrase, 
and a methyl SA esterase, suggest that none of them functions as a 
bona fide SA receptor. By contrast, genetic studies of SA-insensitive 
mutants have strongly suggested that NPR1, which contains a BIB 
(bric a brac, tramtrack, broad-complex) domain, an ankryin repeat 
domain and a nuclear localization sequence, is a potential SA receptor’. 
However, the NPR1 protein does not have considerable SA binding 
activity under different test conditions (Supplementary Fig. 2). 

Instead of direct binding, SA has been shown to control the nuclear 
translocation of NPR1 through cellular redox changes*. In the absence of 
pathogen challenge, NPRI is retained in the cytoplasm as an oligomer 
through redox-sensitive intermolecular disulphide bonds. After induc- 
tion, these disulphide bonds are reduced, releasing NPR1 monomers 
into the nucleus, where NPR1 acts as a cofactor for transcription factors, 
such as TGAs, to induce defence-related genes. In the absence of a 
functional NPRI protein, SA-induced transcriptional reprogramming 
is almost completely blocked. 

The presence of a BIB domain in NPRI1 suggests that, like other 
BTB domain-containing proteins, it may interact with Cullin 3 


(CUL3) E3 ligase and mediate substrate degradation’. However, our 
research led to the surprising finding that the NPRI protein itself is 
degraded by the proteasome. Although NPRI1 is degraded in the 
nucleus of resting cells to dampen basal expression of defence genes, 
it is phosphorylated after immune activation at an IkB-like phospho- 
degron motif, ubiquitinylated by a CUL3 E3 ligase, and degraded to 
sustain maximum levels of target gene expression probably through 
accelerated recycling of the transcription initiation complex"®. 
Blocking NPR1 degradation by mutating the IxB-like phophodegron 
in NPRI or the two CUL3 genes (cul3a cul3b) in Arabidopsis led to 
increased basal resistance, but insensitivity to SAR induction. 
Therefore, nuclear accumulation of NPR1 is needed for basal defence 
gene expression and resistance, whereas its subsequent turnover is 
required for establishing SAR. 


NPR3 and NPR4 are CUL3 adaptors for NPRI degradation 


In a search for the adaptor proteins of the CUL3 E3 ligase that spe- 
cifically target NPR1 for degradation, we considered its paralogues, 
NPR3 and NPR4, as possible candidates, because both contain the 
BTB domain as well as an extra protein-protein interaction domain 
(ankyrin repeat) (Supplementary Fig. 3), which are typical for CUL3 
substrate adaptors’. More importantly, despite their sequence 
similarities to NPR1, the npr3 npr4 double mutant has the opposite 
phenotype of npr1 in that it exhibits enhanced disease resistance"', a 
phenotype reminiscent of the cul3a cul3b mutant”. 

To test our hypothesis that NPR3 and NPR4 are CUL3 adaptors for 
NPRI degradation, we examined the accumulation of NPR1 protein 
in wild-type, npr3, npr4 and npr3 npr4 mutant plants. NPR1 protein 
levels were higher in the npr4 and npr3 npr4 mutants than in the wild 
type in the absence of exogenous SA, and increased faster in the npr3, 
npr4 and npr3 npr4 mutants compared with wild type in response to 
SA treatment (Fig. la). The effects of npr3 and npr4 on NPR1 were 
probably post-transcriptional, as NPR1 transcripts were not increased 
in these mutants (Supplementary Fig. 4). To prove our hypothesis 
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Figure 1 | NPR3 and NPR4 mediate degradation of NPR1. a, NPRI protein 
levels in wild type (WT), npr3, npr4 and npr3 npr4 (npr34) plants treated with 
0.5mM SA. The NPRI1 level (shown at bottom) was determined on the basis of 
the ratio of the NPR1 band intensity to that of the non-specific band (asterisk). 
b, GST-NPRI degradation in extracts from wild-type or npr3 npr4 double 
mutant (npr34), with or without (—) MG115 or with recombinant His-MBP- 
NPR3 and His-MBP-NPR& proteins (NPR3,4). ¢, In vitro pull-down assay of 
GST-CUL3A and NPR3-HA and NPR4-HA. d, Co-immunoprecipitation of 
NPRI-GFP and CUL3 in npr1 and npr1 npr3 npr4 (npr134) plants. 
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further, we performed in vitro degradation experiments using purified 
recombinant glutathione S-transferase (GST)-tagged NPRI1 protein. 
We found that after 15 min of incubation, the recombinant NPR1 
protein was degraded in the wild-type plant extract, but not in npr3 
npr4 (Fig. 1b). The addition of purified recombinant NPR3 and NPR4 
proteins tagged with histidine (His) and maltose binding protein 
(MBP) to the extract complemented the mutant phenotype, support- 
ing a role of NPR3 and NPR4 in mediating NPRI degradation. This 
degradation is probably through the proteasome, as application of the 
proteasome inhibitor MG115 stabilized the protein (Fig. 1b). To 
demonstrate further that NPR3 and NPR4 act as adaptors for the 
CUL3 E3 ligase, we first performed pull-down experiments using in 
vitro translated haemagglutinin (HA)-tagged NPR3 (NPR3-HA) and 
NPR4-HA. We found that CUL3A could pull down NPR3 and NPR4, 
with NPR4 showing a stronger interaction (Fig. 1c). Then we per- 
formed a co-immunoprecipitation assay using transgenic plants con- 
stitutively expressing NPR1-green fluorescent protein (GFP) in npr1 
and nprlnpr3npr4 mutants. We found that the amount of the 
endogenous CUL3 protein pulled down by NPRI-GFP was signifi- 
cantly reduced in the npr1 npr3 npr4 triple mutant compared with the 
npr1 single mutant (Fig. 1d), indicating that the NPRI-GFP inter- 
action with CUL3 requires NPR3 and NPR4. These results further 
support our hypothesis that NPR4 and NPR3 are CUL3 adaptors 
for the degradation of NPR1 before and after SA induction, respect- 
ively (Fig. la). 


SA affects NPRI-NPR3 and NPRI-NPR4 interactions 


Proteasome-mediated protein degradation has a crucial role in regu- 
lating plant hormone receptors’”. In some of these cases, the hormones 
act as a molecular glue to enable the formation of the receptor com- 
plex'*"*, which includes the substrate adaptor for the E3 ligase and the 
corresponding substrate. Our data show that proteasome-mediated 
degradation of NPRI1 is also involved in SA signalling'®, although a 
different E3 ligase (CUL3, instead of CUL1) is used. 

To test the possibility that SA is part of the NPRI-NPR3/4 complex, we 
performed a yeast two-hybrid (Y2H) assay. Using NPR3 as baitand NPR1 
as prey, little growth was observed on plates without SA (Fig. 2a). 
However, yeast growth was observed on plates supplemented with 
100 11M SA or with the functional analogue of SA 2,6-dichloroisonicotinic 
acid (INA)"*, but not on plates with 4-hydroxybenzoic acid (4-HBA)®, 
which cannot induce SAR. Interestingly, although SA promoted the 
NPRI-NPR3 interaction, it disrupted the interaction between NPR1 
and NPR4 (Fig. 2b). Moreover, NPR3 and NPR4 could form both 
homodimers and heterodimers with each other in the presence of 
SA and INA, but not 4-HBA (Fig. 2c). This suggests that NPR3 and 
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Figure 2 | SA directly regulates interactions between NPR proteins. 

a-c, Yeast two-hybrid assay to test interactions between NPR1 and NPR3 
(a), NPR1 and NPR4 (b), and NPR3 and NPR4 (c). Diploid yeast cells were 
spotted on plates (SD media lacking Trp, Leu and His, plus 3 mM 
3-aminotriazole) without (control) or with 100 uM SA, INA or 4-HBA. AD, 
activation domain; BD, DNA-binding domain. 1, NPR1; 3, NPR3; 4, NPR4. 
d, In vitro pull-down assays between His-MBP-NPRI1 and GST-NPR3 and 
GST-NPR4 in the presence or absence of 100 1M SA. 


NPR4 not only control NPRI stability, but also self-regulate. Because 
NPRI1 did not form homodimers with or without SA and interacted 
with NPR2 independently of SA (Supplementary Fig. 5), we focused on 
the regulatory roles of NPR3 and NPR4. 

To validate the Y2H data further, we performed in vitro pull-down 
assay. As shown in Fig. 2d, using the GST-NPR3 protein, we were able 
to pull down His-MBP-NPRI only in the presence of SA. By contrast, 
GST-NPR4 could pull down His-MBP-NPR1 only in the absence of 
SA, indicating that the NPRI-NPR4 interaction was disrupted by SA. 
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NPR3 and NPR4 bind SA with different affinities 

Both the Y2H and the in vitro pull-down results strongly suggest that 
SA directly binds to NPR3 and NPR4 to control their interactions with 
NPRI. To prove that NPR3 and NPR4 are SA receptors, we measured 
their SA-binding activities using [*H]-SA. We found that both GST- 
tagged NPR3 and NPR4 recombinant proteins bound [*H]-SA 
(Fig. 3a, b and Supplementary Fig. 2a). Next, we assessed whether 
active or inactive SA analogues could compete with [PH]-SA to bind to 
GST-NPR3 and GST—NPR4. The active SAR inducer 5-chlorosalicylic 
acid® and INA reduced the binding of PH]-SA to GST-NPR3 and 
GST-NPR4, whereas 4-HBA had little effect (Fig. 3a, b). To assess 
the binding affinity of NPR3 and NPR4, we performed saturation 
binding experiments. Whereas NPR4 had a classical saturation curve 
(Fig. 3c), NPR3 binding could not be saturated even with 1,000 nM 
(PH]-SA, indicating that NPR3 has a lower affinity than NPR4. 
Accordingly, the binding of SA to NPR3 was slower than NPR4 
(Supplementary Fig. 6). Next, we analysed the saturation binding data 
with GraphPad Prism using different models, and found that the 
model ‘one site—specific binding with Hill slope’ is significantly better 
than the other models, which indicates that there are several binding 
sites or fractions in NPR3 and NPR4. The dissociation constant (Kg) 
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Figure 3 | NPR3 and NPR4 bind SA. a, b, Competition binding assay of 
NPR4 (a) and NPR3 (b). c.p.m., counts per minute; CSA, 5-chlorosalicylic acid; 
NC, no competitor. c, Saturation binding assay of NPR4. Kg = 46.2 + 2.35 nM, 
h = 0.830 + 0.0314. d, Dissociation assay of NPR4. The dissociation was 
initiated by the addition of 1 mM non-radioactive-labelled SA (cold SA) or by 
infinite dilution. By and B, are total binding before and after dissociation, 
respectively. e, Homologous competitive binding assay of NPR3. 

ICso = 1,811 nM (log(ICs9) = 3.26 + 0.0901), h = 0.554 + 0.0612. f, Size 
exclusion chromatography showing that NPR4 tetramer binds SA (black). Top 
panel, elution profile. Green, red, purple and orange peaks correspond to 2,000, 
158, 75 and 44 kDa, respectively. Bottom panel, total binding of PH]-SA in 
different fractions. mAU, milli-absorbance units. Error bars represent s.d. 

(n = 2 or 3). 
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for NPR4 was 46.2 + 2.35nM (mean + s.e.m.) with a Hill coefficient 
(h) of 0.830 + 0.0314. To check the cooperativity of different binding 
sites, we carried out dissociation experiments by the addition of 1mM 
non-radioactive-labelled SA (cold SA) or by infinite dilution. The dis- 
sociation curves (Fig. 3d) indicate that NPR4 has several SA-binding 
sites, and the lack of overlap between the two curves suggests negative 
cooperativity between these binding sites (the first binding reduces the 
affinity for subsequent binding). The Kg value for NPR3 (981 nM, 
Supplementary Fig. 7) was significantly higher than 100 nM, which 
made saturation binding an inappropriate way to calculate the Kq. 
Therefore, we performed a homologous competitive binding assay 
(Fig. 3e). The half-maximum inhibitory concentration (ICs9) was cal- 
culated to be 1,811nM (log(ICs9) = 3.26 + 0.0901) with a Hill co- 
efficient of 0.554 + 0.0612. Through these analyses, we demonstrated 
that NPR3 and NPR4 bind SA specifically and with different affinities. 
To examine the receptor complex further, we performed gel filtra- 
tion analysis on the purified recombinant NPR4 protein—the receptor 
with the higher affinity to SA. Because the recombinant NPR4 protein 
spontaneously oligomerized in vitro in the absence of a reducing agent, 
our analysis focused on samples pretreated with 100 mM dithiothreitol 
(DTT) followed by dialysis against 5 mM DTT. We discovered that 
NPR4 was present in an estimated tetrameric form, which was com- 
petent in binding to SA (Fig. 3f). Notably, SA binding did not change 
the gel filtration elution profile of the protein. Further experiments are 
required to investigate how SA affects the receptor complexes to make 
them either more accessible (that is, NPR3 binding to NPR1) or less 
accessible (that is, NPR4 binding to NPR1) for substrate binding. 


The npr3 npr4 double mutant is defective in SAR and ETI 


To understand the biological significance of NPR3/4-mediated degra- 
dation of NPR1, a positive regulator of SAR, we first performed SAR 
tests in the npr3, npr4 and npr3 npr4 mutants using Pseudomonas 
syringae pv. maculicola ES4326 (Psm ES4326). Consistent with a pre- 
vious report"’, there was a significant reduction in Psm ES4326 growth 
in the npr3 npr4 double mutant without SAR induction (Fig. 4a). 
However, even after SAR induction by local inoculation of avirulent 
Psm ES4326/avrRpt2, no further reduction in growth of virulent Psm 
ES4326 in systemic tissue was observed in the npr3npr4 double 
mutant. To a lesser degree, SAR was also defective in the npr3 single 
mutant. Thus, stabilization of NPRI protein in the npr3 and npr3 npr4 
mutants rendered these plants insensitive to SAR induction. This 
phenotype is similar to that observed in the cul3acul3b double 
mutant”, validating the role of NPR3 and NPR4 in CUL3-mediated 
degradation of NPRI1 and SAR. 

On the basis of our knowledge that SAR and ETI are two distinct 
defence strategies, with the former promoting cell survival and the 
latter triggering PCD, we then tested the npr mutants for ETI using 
Pseudomonas strains expressing different effectors. Surprisingly, we 
found that the npr3 npr4 mutant failed to undergo PCD (Fig. 4b and 
Supplementary Fig. 8a) as quantified by ion leakage (Fig. 4c), and was 
compromised in resistance triggered by the effectors (Fig. 4d). The 
same phenotypes were observed in different mutant alleles of npr3, 
npr4 and npr3npr4 (Supplementary Fig. 8b). The ETI deficiency 
observed in npr3 npr4 is probably caused by the increased accumula- 
tion of NPRI, because this phenotype was suppressed in the npr1 npr3 
npr4 triple mutant. 

To observe NPRI turnover in response to pathogen challenge in situ, 
we inoculated Psm ES4326/avrRpt2 in the 35S:NPR1(C82A)-GFP 
transgenic plant, in which NPR1 is constitutively localized in the nuc- 
leus (Fig. 4e)'®. Eleven hours after inoculation, some cells showed 
increased chlorophyll leakage (Fig. 4f, red) with overlapping accumula- 
tion of phenolic compounds (Fig. 4e, larger green spots) indicative of 
PCD (Fig. 4g, yellow), whereas other cells were still intact. The 
NPRI1(C82A)-GFP fluorescence was markedly reduced inside the 
inoculated region (Fig. 4g, h). Notably, the NPR1(C82A)-GFP fluor- 
escence level was the highest in the cells surrounding the hypersensitive 
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Figure 4| SA receptors control NPRI stability to regulate SAR and ETI. 

a, SAR test in wild-type, npr3, npr4, npr3 npr4 (npr34), npr1 npr3 npr4 (npr134) 
and nprl1. c.f-u., colony forming units. b-d, ETI test in different mutants using 
Psm ES4326/avrRpt2. b, The hypersensitive response phenotype, 2 and 3 days 
post inoculation (d.p.i.). c, Ion leakage measurement. rps2, an avrRpt2-insensitive 
mutant. Error bars represent s.d., n = 4. d, Growth of Psm ES4326/avrRpt2. Error 
bars in a and d represent 95% confidence intervals, n = 6-8. *P < 0.05, 

***P < 0.001. e-g, Close-up images of an infection site by Psm ES4326/avrRpt2. 
Arrows point to intact cells inside the inoculated area. h, Image of the whole 
infection site showing high NPR1(C82A)-GFP accumulation surrounding the 
PCD zone. The rectangle shows the area from which the close-up images in 
e-g were taken. Yellow staining in g and h indicates dead cells, green indicates 
NPRI1(C82A)-GEP. Original magnification, 10 (e-g) and <2 (h). 


response lesion (Fig. 4h), consistent with the genetic data suggesting 
that NPR1 is an inhibitor of PCD during ETI. 


Discussion 

Through this study, we identified the NPR1 paralogues NPR3 and 
NPR4 as receptors for the immune signal SA. These receptors have 
different binding affinities to SA (Fig. 3), suggesting that they may be 
differentially responsive to spatiotemporal changes in cellular SA 
concentrations. SA controls accessibility of the CUL3 ligase adaptors 
NPR3 and NPR4 to their substrate NPR1 (Fig. 2), thereby regulating 
NPRI stability and activity (Figs 1 and 4). 

On the basis of our findings, we present a working model for the 
regulation of NPR1 by NPR3 and NPR¢ in response to different SA 
levels (Supplementary Fig. 1). In the absence of pathogen challenge, 
NPR4 constantly removes most of the NPR1 protein by CUL3-NPR4- 
mediated degradation. This degradation is important to prevent 
spurious activation of resistance. However, basal SA is required to 
disrupt some of the NPR1-NPR4 interactions to maintain the basal 
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level of NPR1. This is crucial because SA-deficient plants, eds5 
(ref. 17), ics] (also known as eds16)'* and the nahG transgenic line 
expressing an SA-degrading enzyme”, are impaired in maintaining 
NPRI1 homeostasis (Supplementary Fig. 9), resulting in enhanced 
disease susceptibility (Supplementary Fig. la, b). After challenge by 
pathogens that trigger ETI, SA levels increase both locally and sys- 
temically” to form a concentration gradient from the infection 
site*!”’, Previous research has shown that high levels of SA facilitate 
PCD**, and the spread of PCD may be controlled by the activities of 
proteins such as LSD1 (a zinc-finger protein) and Atrboh D (an 
NADPH oxidase). Because the npr3npr4 double mutant can no 
longer undergo PCD in response to pathogen effectors, we propose 
that NPR1, which over-accumulates in the npr3 npr4 mutant, can act 
as a negative regulator of PCD. Our finding is in line with a previous 
report suggesting that NPR1 suppresses hypersensitive response”. In 
support of this function of NPR1, the NPR1-GFP signal is the lowest 
inside the developing hypersensitive response (Fig. 4e-h) owing to 
CUL3-NPR3-mediated degradation of NPR1 (Supplementary Fig. 1c). 
In neighbouring cells, the lower SA level limits NPRI-NPR3 inter- 
action, enabling NPRI to accumulate in the margin of the hyper- 
sensitive response to restrict the spread of PCD and establish SAR 
(Supplementary Fig. 1d). 


METHODS SUMMARY 


Protein analysis was carried out as previously described’. Total NPR1 protein 
was detected by an antibody against NPR1. For the in vitro degradation assay, the 
purified recombinant GST-NPR1 was incubated with plant extracts and detected 
by an anti-GST antibody (GE Healthcare). The in vitro pull-down assays for 
CUL3A, NPR3 and NPR4 were performed using purified recombinant GST- 
CUL3A and in vitro translated NPR3-HA and NPR4-HA, which were detected 
by an anti-HA antibody (GenScript). Purified recombinant GST-NPR3 and 
GST-NPR4 proteins retained on the glutathione agarose beads were used to pull 
down purified His-MBP-NPR1. Bound His-MBP-NPRI was detected by western 
blot using an anti-His antibody (GenScript). For co-immunoprecipitation, the 
immunoprecipitation was performed using an anti-GFP antibody (Abcam) and 
the western blot using an anti-CUL3A antibody and an anti-GFP antibody 
(Clontech). Y2H assays were carried out using the Matchmaker system 
(Clontech). The interactions were determined by yeast growth on selective medium 
(SD media lacking Trp, Leu and His, plus 3 mM 3-aminotriazole) with or without 
100 1M SA, INA or 4-HBA. The SA-binding assays were performed as described® 
with modifications using purified recombinant GST-NPR3 or GST-NPR4 and 
(PH]-SA (American Radiolabelled Chemicals). Pathogen infection and ion leakage 
assay using Psm ES4326 with or without avrRpt2 were carried out as previously 
described’? 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Arabidopsis thaliana mutants and transgenic lines. Arabidopsis thaliana 
mutants (in ecotype Col-0) npr3-1, npr4-3, npr3-1npr4-3, npr1-1 npr3-1 

npr4-3, npr3-2 (SALK_043055), npr4-2 (SALK_098460) and npr3-2 npr4-2 were 
provided by Y. Zhang’. 35S:NPR1-GFP was introduced into the npr1-2 

npr3-1 npr4-3 background by crossing the 35S:NPR1-GFP transgenic plants (in 
npr1-2) with the npr3-1npr4-3 plants. Homozygous plants were selected by 
genotyping. 

Co-immunoprecipitation assay. Three-week-old plant sample was collected and 
ground in liquid N>. Protein was extracted in the extraction buffer (50 mM Tris- 
HCl, pH 7.5, 150 mM NaCl, 5 mM EDTA, 0.1% Triton X-100, 0.2% Nonidet P-40, 
and inhibitors: 50 pgml~! -N-tosyl-L-phenylalanyl chloromethyl ketone 
(TPCK), 50 ug ml! -N-tosyl-L-leucine chloromethyl ketone (TLCK), 0.6 mM 
phenylmethylsulphonyl fluoride (PMSF) and 40 uM MG115). The extracts were 
then pre-cleared with 50 11 of Dynabeads Protein G (Invitrogen). After 1 pl of 
anti-GFP antibody (Abcam) was added to the extracts and incubated for 2 h, 50 ul 
of magnetic Dynabeads was added to the samples and incubated for another hour 
with gentle rocking. The magnetic Dynabeads were then washed three times using 
the protein extraction buffer, and bound proteins were eluted by heating the 
magnetic beads in the SDS sample buffer containing 100 mM DTT at 95 °C for 
10 min. The NPRI-GFP and CUL3 proteins were detected by western blotting 
using an anti-GFP antibody (Clontech) and an anti-CUL3A antibody”, 
respectively’®. 

Pathogen infection. To test for the hypersensitive response, the avirulent 
pathogen Psm ES4326 carrying avrRpt2 (attenuance (D)¢o0nm = 0.02) or 
avrRpm1 (Deoonm = 0.1) and Pst carrying avrRps4 (Deéoonm = 0.1) or avrRpt2 
(D6oonm = 9.02) were infiltrated into 3—4-week-old leaves. Cell death was 
recorded 2-3 days after the infiltration. Ion leakage was recorded over time as 
described”’. To test for SAR, two lower leaves of 3-week-old plants were pressure- 
infiltrated with 10mM MgCl, or avirulent bacterial pathogen Psm ES4326 
carrying avrRpt2 (Déoonm = 0.02). Three days later, virulent bacterial pathogen 
Psm ES4326 (Deo0nm = 0.001) was infiltrated into two upper leaves. Disease 
symptoms were monitored and bacterial growth was analysed 3 days after the 
inoculation”®. 

Imaging of NPR1-GFP in infection site. The 35S: NPR1(C82A)-GFP plants were 
inoculated with Psm ES4326 carrying avrRpt2 (Déo0nm = 0.02) and incubated for 
11h. Leaf tissues were mounted in 10% glycerol and viewed with a BIOREVO 
(Keyence) BZ-9000 fluorescence microscope. The GFP signal is monitored with 
an excitation wavelength of 472.5 nm and a bandpass 502.5 to 537.5 nm emission 
filter. Red chlorophylls were viewed with an excitation wavelength of 540 nm and 
a bandpass 573 to 613 nm emission filter. To obtain wide-field view (2 x 5 
pictures), image stitching was performed by BZ-II Image Analysis Application. 
Experiments were repeated eight times. 

Molecular cloning of NPRs. The coding regions of NPRI, 2, 3 and 4 were 
amplified with PrimeSTAR HS DNA polymerase (Takara) using specific primers 
containing Gateway attB sites (Supplementary Table 1), and then cloned into the 
pDONR207 entry vector using the BP Clonase (Invitrogen) to create the NPR 
entry clones. After verification by sequencing, each of the clones was mobilized 
using the LR Clonase (Invitrogen) into the Gateway destination vectors pDEST- 
GBKT7 and pDEST-GADT7 for yeast transformation”, the protein expression 
vectors pDEST15 (Invitrogen) and pDEST-HisMBP (Addgene plasmid 11085)” 
for making the GST and His-MBP fusions, respectively. 

Detection of the NPRI protein. Four-week-old plants were sprayed with 0.5 mM 
SA and collected at different time points. Total protein was extracted in a buffer 
containing 50 mM Tris-HCl, pH7.5, 150 mM NaCl, 5mM EDTA, 0.1% Triton 
X-100, 0.2% Nonidet P-40, and inhibitors: 50 1g ml’ TPCK, 50 ug ml ' TLCK, 
0.6 mM PMSF and 40 1M MG115 (ref. 30). The homogenates were centrifuged 
twice at 13,500g for 15 min each. Protein was denatured in the SDS sample buffer 
containing 100mM DTT at 75°C for 10 min, and western blot analysis was 
performed using an antibody against NPRI (ref. 16). 

Quantitative real-time PCR. Total RNA was extracted from 4-week-old control 
and SA-treated plants at the indicated time points using TRIzol Reagent 
(Invitrogen). Genomic DNA was eliminated by treatment of the RNA with 2 U 
of TURBO DNA-free (Ambion). cDNA was synthesized using the Superscript III 
Reverse Transcription kit (Invitrogen) and analysed by quantitative real-time 
PCR using the FastStart Universal SYBR Green Master (Rox) kit (Roche) with 
gene-specific primers for NPR1 and ubiquitin 5 (Supplementary Table 1). 
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In vitro degradation assay. The NPRI1 degradation assay was performed as 
described’’. Leaves from wild-type or npr3-1 npr4-3 double mutant plants were 
ground in liquid N, and resuspended in the proteolysis buffer containing 25 mM 
Tris-HCl, pH7.5, 10 mM MgCl, 10mM NaCl, 10mM ATP and 5mM DTT. 
After centrifugation, the supernatants were mixed with the GST-NPR1 protein 
purified from Escherichia coli and incubated at room temperature for 15 min. The 
reactions were stopped by adding the SDS sample buffer containing 100 mM DTT 
and incubated at 75 °C for 10 min. The level of GST-NPRI protein was analysed 
by western blotting using an anti-GST antibody (GE Healthcare). 

In vitro pull-down assay. The coding sequence of CUL3A was cloned into the 
GST vector pGEX4T-2 (GE Healthcare) for expression in E. coli BL21(DE3). The 
coding sequences of NPR3 and NPR4 were cloned into pCMX-PL2 for in vitro 
translation using TNT Quick Coupled Transcription/Translation System 
(Promega) to produce the HA-tagged NPR3 and NPR4 proteins. The purified 
GST-CUL3A protein was bound to the glutathione agarose beads, incubated with 
the HA-tagged NPR3 or NPR4 protein, and washed three times with EB buffer 
(50mM Tris-HCl, pH7.2, 100mM NaCl, 1mM EDTA, 1mM EGTA, 1% 
dimethylsulphoxide, 20mM DTT and 0.1% NP40). The HA-tagged NPR3 or 
NPR4 protein bound to GST-CUL3A protein was detected by western blot ana- 
lysis using an anti-HA antibody (GenScript). Recombinant His-MBP-tagged 
NPRI and GST-tagged NPR3 and NPR4 proteins were produced in E. coli. The 
recombinant His-MBP-tagged NPR1 was purified using the Ni-NTA resin 
(QIAGEN). GST-tagged NPR3 and NPR4 proteins were purified using 
glutathione beads and retained on the beads to pull down purified His-MBP- 
NPRI protein with or without 100 1M SA ina buffer containing 50 mM Tris-HCl, 
pH6.8, 100mM NaCl and 0.1% NP40. After washing three times, bound His- 
MBP-NPRI was eluted by heating the glutathione beads at 95 °C for 10 min in the 
SDS buffer with 100 mM DTT and detected by western blot analysis using an anti- 
His antibody (GenScript). Equal loadings were verified by staining the membrane 
with Ponceau S. 

Yeast two-hybrid assay. The Saccharomyces cerevisiae yeast strains AH109 and 
Y187 were transformed with pGADT7-NPRI, 2, 3, 4and pGBKT7-NPRI, 2, 3, 4, 
respectively, according to the Clontech yeast transformation protocol. Yeast 
strains were grown on SD—Trp—Leu plates, and then fresh single colonies were 
grown for 1 day in the SD—Trp—Leu liquid media. Interactions between bait and 
prey were detected on the selective media: SD—Trp—Leu—His (control), 
SD—Trp—Leu—His with 100M sodium salicylate (SA), 100}.M INA, or 
100 uM 4-HBA. All of the SD—Trp—Leu—His selective media also contained 
3 mM 3-aminotriazole. 

SA-binding assay. The SA-binding assay was performed as described® with 
modifications. The GST-NPR3 and GST-NPR4 proteins were expressed in 
E. coli C41 and purified using Pierce Glutathione Magnetic Beads (Thermo). 
The protein-bound beads were incubated in 100 ul buffer containing 30 mM 
sodium citrate, pH6.3, 1mM EDTA and PH]-SA (American Radiolabelled 
Chemicals, specific activity 30 Ci mmol ~ !) The beads were washed twice, resus- 
pended in 100 pil H,O, mixed with 6 ml Ultima Gold Cocktails (PerkinElmer) and 
counted using the LC6000SC liquid scintillation counter (Beckman Instruments). 
The non-specific binding was determined in the presence of 1 mM unlabelled SA. 
The data were analysed using GraphPad Prism 5. 

Gel filtration analyses of NPR4. NPR4 was overexpressed as a GST-fusion 
protein in insect cells and purified by glutathione affinity chromatography in 
the presence of 100mM DTT. After TEV cleavage, NPR4 was further purified 
by anion exchange chromatography and dialysed against a buffer containing 
20 mM Tris-HCl, pH 8.0, 200 mM NaCl and 5mM DTT. After concentration, 
0.5-1 mg NPR4 was analysed in the same buffer with and without 10 11M SA as 
indicated by size exclusion chromatography on a Supderdex 200 gel filtration 
column. Co-elution of SA and NPR4 was monitored by [°H]-SA, which was pre- 
mixed with the NPR4 protein before injection. 
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The Hubble Deep Field provides one of the deepest multiwave- 
length views of the distant Universe and has led to the detection of 
thousands of galaxies seen throughout cosmic time’. An early map 
of the Hubble Deep Field at a wavelength of 850 micrometres, which 
is sensitive to dust emission powered by star formation, revealed the 
brightest source in the field, dubbed HDF 850.1 (ref. 2). For more 
than a decade, and despite significant efforts, no counterpart was 
found at shorter wavelengths, and it was not possible to determine 
its redshift, size or mass*’. Here we report a redshift of z = 5.183 for 
HDF 850.1, from a millimetre-wave molecular line scan. This places 
HDF 850.1 in a galaxy overdensity at z~ 5.2, corresponding to a 
cosmic age of only 1.1 billion years after the Big Bang. This redshift 
is significantly higher than earlier estimates**** and higher than 
those of most of the hundreds of submillimetre-bright galaxies 
identified so far. The source has a star-formation rate of 850 solar 
masses per year and is spatially resolved on scales of 5 kiloparsecs, 
with an implied dynamical mass of about 1.3 x 10"' solar masses, a 
significant fraction of which is present in the form of molecular gas. 
Despite our accurate determination of redshift and position, a 
counterpart emitting starlight remains elusive. 

We have obtained a full-frequency scan of the 3-mm band towards 
the Hubble Deep Field using the IRAM (Institut de Radioastronomie 
Millimétrique) Plateau de Bure Interferometer. The observations covered 
the frequency range from 80-115 GHz in ten frequency settings at 
uniform sensitivity and at a resolution (about 2.3’’) that is a good 
match to galaxy sizes at high redshift. They resulted in the detection 
of two lines of carbon monoxide (CO), the most common tracer for 
molecular gas at high redshift’, at 93.20 GHz and 111.84 GHz at the 
position of HDF 850.1. Identifying these lines with the J = 5 and J = 6 
rotational transitions of CO gives a redshift for HDF 850.1 of z = 5.183. 
This redshift was then unambiguously confirmed by the Plateau de 
Bure Interferometer’s detection of the 158-1m line of ionized carbon 
([Cu], redshifted to 307.38 GHz), one of the main cooling lines of the 
star-forming interstellar medium. Stacking of other molecules covered 
by our frequency scan that trace higher volume densities did not lead to 
a detection (see Supplementary Information). Subsequently, the J = 2 
line of CO has also been detected using the National Radio Astronomy 
Observatory (NRAO) Jansky Very Large Array at 37.29 GHz. The 
observed [C 1] and CO spectra towards HDF 850.1 are shown in Fig. 1. 

The beam size of our CO observations (about 2.3'’, 15 kpc at 
z= 5.183) is too large to spatially resolve the molecular gas emis- 
sion in HDF 850.1. However, the [C1] and underlying continuum 


observations (around 1.2'’ X 0.8’’) show that the source is extended 
(hitherto, the interstellar medium has been spatially resolved only in 
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Figure 1 | Detection of four lines tracing the star-forming interstellar 
medium in HDF 850.1. a, [C1], vo,, = 307.383 GHz. b, CO(6-5), 

Vobs = 111.835 GHz. c, CO(5-4), Vong = 93.202 GHz. d, CO(2-1), 

Vops = 37.286 GHz. Zero velocity corresponds to a redshift of z = 5.183. 
Continuum emission is detected in a and b at 6.80 + 0.8 mJy and 

0.13 + 0.03 mJy, respectively. We derive a 3o continuum limit of 30 py from 
the Jansky Very Large Array observations at 37.3 GHz using a bandwidth larger 
than shown here. Gaussian fits to the lines give a full width at half maximum 
(FWHM) of 400 + 30kms° ', narrower than typically found in sub-millimetre 
selected galaxies'’. The observed integrated line flux densities are: 

Siew = 14.6 £0.3Jykms |, Stcow-sy) = 0.39 + 0.1Jykms |, Stcocs-4)] = 
0.50 + 0.1Jykms ~ and Sjcov2-1)) = 0-17 + 0.04 Jy kms’. The resulting line 
luminosities are? 5.0 X 10!°Kkms ! pe, 1.0 10°Kkms pe’, 

1.9 10'°Kkms | pc’and 4.1 X 10!°Kkms ! pc? or 1.10 X 10! °Lguns 

1.06 X 10°Lguny 1.14 X 10°Lsun and 1.5 X 10’Lsun (uncertainties as given for 
integrated line flux densities). Large velocity gradient modelling gives a 
predicted CO(1-0) line luminosity of 4.3 X 10'°Kkms | pc”. 
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extremely rare quasar host galaxies at such high redshifts'®). A single 
Gaussian fit yields a deconvolved size of 0.9 + 0.3’’, or 5.7 + 1.9 kpc, at 
the redshift of the source. Figure 2 shows the maps of total [Cu] 
emission (Fig. 2a) as well as the red- and blue-shifted parts of the 
[Cu] line (Fig. 2b) superposed on the deepest available Hubble 
Space Telescope images of the Hubble Deep Field’. The derived 
dynamical mass is Mayn ~ 1.3 + 0.4 X 10) Meuns assuming an arbit- 
rary inclination of 30°. An alternative interpretation is that the source 
is a merger of two galaxies, rather than a single rotating disk, which 
would lower the implied dynamical mass. Figure 2 shows that the 
source is completely obscured in the observed optical and near-infra- 
red wavebands (that is, the rest-frame ultraviolet). There is no indica- 
tion of HDF 850.1 harbouring an active galactic nucleus powered by a 
supermassive black hole (quasar)"’. 

The CO(6-5)/CO(2-1) line luminosity ratio (in units of 
Kkms ! pe’) (ref. 9) is 0.23 + 0.05. Assuming that the gas is being 
emitted from the same volume, this implies that the high-J CO 
emission is sub-thermally excited on galactic scales, less than seen in 
the nuclei of local starburst galaxies’. Using a standard large velocity 
gradient model we find that the observed CO line intensities can be 
fitted with a moderate molecular hydrogen density of 10°? cm * anda 
kinetic temperature of 45K for virialized clouds (velocity gradient 
dv/dr=1.2kms ‘pe '). We caution that these numbers would 
change if the CO transitions were not emitted from the same volume. 
The predicted CO(1-0) line luminosity is 4.3 X 10'° Kkm s"' pc’, close 
to the measured value for CO(2-1). Depending on the choice of «, the 
CO-to-H, conversion factor, this line luminosity implies a molecular 
gas mass of My = 3.5 X («/0.8) X 10'°Mgun; here « = 0.8, in units 
of Mgun (K km s"! pe’), is the conversion factor adopted for ultra- 
luminous infrared galaxies (ULIRGs)"* and thought to be applicable 
to sub-millimetre bright objects'*. The implied molecular gas mass 
fraction is My2/Mayn ~ 0.25 + 0.08 («/0.8); that is, even with a low 
ULIRG conversion factor the molecular gas constitutes a significant 
fraction of the overall dynamical mass. This molecular gas mass 
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Figure 2 | [C11] line emission towards HDF 850.1. a, [C11] contours on top of 
a deep Hubble Space Telescope image’ of the region in a filter (I band) that 
covers the Lyman-¢ line and ultraviolet continuum at z = 5.183. [Cm] contours 
show the averaged emission over 700kms_' and are plotted at 5 mJy per beam, 
7 mJy per beam, 9 mJy per beam and 11 mJy per beam (1o = 1.3 mJy per beam). 
A Gaussian fit to the emission gives a deconvolved size of 0.9 + 0.3'’ or 

5.7 + 1.9 kpc at z = 5.183. The underlying continuum emission (not shown) is 
also extended on the same scales. b, The blue and red contours indicate the 
approaching and receding [C11] emission relative to the systemic redshift of 
z= 5.183. The colour shows a deep Hubble Space Telescope image in a longer 
wavelength filter (the J band from the Hubble Space Telescope’s near-infrared 
camera and multi-object spectrometer (NICMOS))”’. The cross indicates the 
position (and its 5o uncertainty) of the rest-frame 158-j1m continuum emission 
peak (right ascension 12 h 36 min 51.976 s, declination 62° 12’ 25.80’’ in the 
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(and fraction) is comparable to what is found in other sub- 
millimetre bright galaxies that are typically located at much lower 
redshift'*”’. 

The line-free channels of the observations (Fig. 1) were used to con- 
strain the underlying continuum emission. Our accurate position of the 
rest-frame 158 [im emission is indicated as a cross in Fig. 2 (right). We 
combine our continuum detections at 307 GHz and 112 GHz with 
published values and new Herschel Space Telescope observations to 
constrain the far-infrared properties of the source (see Supplementary 
Information for details). Our best fit gives a far-infrared luminosity of 
Ler = (6.5 + 1) X 10'*Lgun, a dust temperature of 35+ 5K (that is, 
broadly consistent with the average kinetic temperature of the molecular 
gas), a dust mass of Maust = (2.75 + 0.5) X 10°Mgyn and a star forma- 
tion rate of 850Mgy, per year (with an uncertainty of about 30%). Given 
the extent of the source this results in an galaxy-averaged star formation 
rate surface density of 850Mz,,, per year divided by (n(2.8 kpc)’) equal- 
ling 35Mgun per year kpc? (uncertainty ~50%), more than an order of 
magnitude less than found in nearby merging systems and a compact 
quasar host galaxy at z = 6.42 that has been studied in similar detail’®. 
HDF 850.1 falls on the universal local star-formation law that relates the 
average surface density of the star formation rate to that of the molecular 
gas mass per local free-fall time’’. The estimated surface density would 
increase if future observations resolved the source structure. 

The resulting [C1]/far-infrared luminosity ratio of Lyciy/Lerr = 
(1.7+0.5) X10 * in HDF850.1 is comparable to what is found in 
normal local star-forming galaxies'’, but is an order of magnitude 
higher than what is found in a z= 6.42 quasar’’, the only other 
high-z system where the [C11] emission could be resolved to date. 
Recent studies indicate that this ratio is a function of environment, 
with a low value (Lpcm/Lyip~ 1X 10-*) for luminous systems 
dominated by a central black hole (quasars) and a high ratio (up to 
Licm/Lrir ~ 1 X 10°?) for low-metallicity environments. Our rela- 
tively high ratio in Ljcu/Lrrp is consistent with HDF 850.1 being a 
high redshift star-forming system in a non-quasar environment”. 


J2000.0 system), consistent with earlier millimetre interferometric 
measurements”® at lower resolution. The [C 1] contours have been derived by 
averaging the spectrum (Fig. 1) from —400kms ‘to0kms ‘and0kms ‘to 
+400kms ' and are plotted at levels of 7 mJy per beam, 10 mJy per beam and 
13 mJy per beam (1o = 1.8 mJy per beam), respectively. In each panel the beam 
size of the [C1] observations (1.23’’ X 0.81'’) is indicated in the bottom left 
corner. From the spatial offset (total offset = 0.9’’, that is, radius r is 0.45'’ or 
2.8 kpc) and the FWHM of the line, we derive an approximate dynamical mass 
of Mayn ~ 3.4 X 10°Mgun/ (sini) where i is the (unknown) inclination of the 
system (using Maynsin’i = 1.3 X (FWHM/2)"1/G, where G is the gravitational 
constant*’). These deep Hubble Space Telescope images of the Hubble Deep 
Field fail to reveal the (rest-frame) ultraviolet/optical counterpart of the galaxy 
that is forming stars at a rate of about 850Mguy per year. 
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Figure 3 | Distribution of galaxies near HDF 850.1. a, Distribution of 
spectroscopic redshifts towards the Hubble Deep Field and its surroundings 
(from the Great Observatories Origins Deep Survey-North, GOODS-N). 
HDF 850.1 is indicated in red, and the quasar at the same redshift’* is indicated 
in blue. There is an overdensity of galaxies in the redshift bin that contains 
HDF 850.1. The high source density at z ~ 5.7 is an observational artefact due 
to narrow-band Lyman-o imaging surveys of the region (with spectroscopic 


An inspection of the distribution of galaxies towards HDF 850.1 that 
have spectroscopic redshifts shows that there is an overdensity of 
galaxies at the exact redshift of HDF 850.1, including a quasar at 
z= 5.186"* (Fig. 3 and Supplementary Information). This makes this 
region one of the most distant galaxy overdensities known to date’. An 
elliptical galaxy at z = 1.224 (ref. 20) that is situated close to HDF 850.1 
in projection (around 1"’ to the northeast) could potentially act as a 
gravitational lens for this source**”'. Using a velocity dispersion of 
146kms | in a singular isothermal sphere for this elliptical galaxy’ 
and our new redshift and position of HDF 850.1, we derive an amp- 
lification factor of around 1.4. A similar flux amplification is found for 
a simple point source lens model with mass 3.5 X 10''Mgun. This 
implies that even if lensing is occurring, the quantities derived here 
would not need to be revised significantly. 

HDF 850.1 remains outstanding in the study of dust-obscured 
starbursts at high redshift, being one of the first such sources discovered, 
and yet evading detection in the optical and near-infrared. Its redshift 
of z = 5.183 enforces the presence ofa high redshift tail (z > 4) of sub- 
millimetre bright star-forming galaxies (that is, a galaxy without an 
active galactic nucleus); currently there are only about half a dozen 
systems known’**®. Only a small fraction of submillimetre-bright 
sources is expected to be at very high redshift’’—it is thus ironic that 
the first blank-field source belongs to this subgroup. HDF 850.1’s large 
spatial extent, in combination with the modest CO excitation, 
the moderate surface density of its star-formation rate, and a high 
[C u]/far-infrared luminosity ratio, points to the presence of a spatially 
extended major starburst that is completely obscured even in the 
deepest Hubble Space Telescope images available for the Hubble 
Deep Field. The absence of a possible counterpart in the available 
deep imaging, even though the star-forming interstellar medium is dis- 
tributed over many square kiloparsecs, makes this source extreme”**. 
Given its high molecular gas mass (3.5 X («/0.8) X 10'°Mgun) and star- 
formation rate (850Msun per year), HDF 850.1 can build a significant 
stellar component as early as z ~ 4 (ref. 28; a few hundred million years 
from z~ 5). Blind line searches through spectral scans at millimetre 
wavelengths, as performed here, thus play a fundamental role in unveil- 
ing the nature of star-forming galaxies that are completely obscured in 
the (restframe) optical and ultraviolet even if multiwavelength data at 
unparalleled depth are available. 
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Possible tropical lakes on Titan from observations of 


dark terrain 
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Titan has clouds, rain and lakes—like Earth—but composed of 
methane rather than water. Unlike Earth, most of the condensable 
methane (the equivalent of 5 m depth globally averaged’) lies in the 
atmosphere. Liquid detected on the surface (about 2m deep) has 
been found by radar images only poleward of 50° latitude’, while 
dune fields pervade the tropics*. General circulation models explain 
this dichotomy, predicting that methane efficiently migrates to the 
poles from these lower latitudes*’. Here we report an analysis of 
near-infrared spectral images* of the region between 20° N and 
20°S latitude. The data reveal that the lowest fluxes in seven 
wavelength bands that probe Titan’s surface occur in an oval region 
of about 60 x 40km/’, which has been observed repeatedly since 
2004. Radiative transfer analyses demonstrate that the resulting 
spectrum is consistent with a black surface, indicative of liquid 
methane on the surface. Enduring low-latitude lakes are best 
explained as supplied by subterranean sources (within the last 
10,000 years), which may be responsible for Titan’s methane, the 
continual photochemical depletion of which furnishes Titan’s 
organic chemistry’. 

At narrow wavelength regions, or ‘windows’, centred at 0.93 jim, 
1.08 um, 1.28um, 1.58um, 2.00pm, 2.80 um and 5.0m, Titan’s 
atmosphere is sufficiently transparent to allow visibility of the sur- 
face”’®. Within 20°S and 20° N latitude, the optical depth and the 
scattering properties of the two main sources of opacity, methane 
and photochemical haze, are well characterized, both by in situ 
measurements from the Huygens probe at 10° S latitude and 191° W 
longitude’*’, and remote Cassini measurements'**. The determined 
constraints enable radiative-transfer analyses of Titan’s spectra to 
discern the effects of the surface from those of the atmosphere. We 
consider data obtained only at low phase angles (<32°), low incident 
and scattering angles (<45°), and a pixel scale less than 15 km per 
pixel, such that surface terrains are best discerned and most directly 
illuminated and viewed. Given these specifications and an exposure 
time of at least 50s, between October 2004 and December 2008, 
Cassini’s Visual and Infrared Mapping Spectrometer (VIMS) imaged 
about 17% (2.85 X 10° km?) of Titan’s 20° S-20° N surface, with the 
landing site most comprehensively sampled (Supplementary Fig. 1, 
Supplementary Information). 

The selected data comprise 32,564 spectra, which display values for 
Titan’s outgoing intensity divided by its incident flux (J/F) that span an 
order of magnitude in the clearest windows. The general trend of 
these values is well understood. At the shortest-wavelength windows, 
I/F values are higher (Fig. 1), because the atmospheric scattering of 
sunlight from the bright haze increases with decreasing wavelengths”. 
Yet the unique statistics of the dark terrains is unexpected: the lowest 
I/F values occur in the same pixels (same terrain) for all windows; that 
is, they represent the same observed spectrum. In addition, these pixels 
form an abrupt lower threshold to the range of I/F values, which is in 
contrast to the gradual falling off of the highest I/F values (Fig. 1). The 
sudden lower cut-off in the I/F and its wavelength dependence suggest 
that we have found the signature of a black surface, where the observed 
I/F is controlled solely by the scattering and absorption of sunlight 


due to the overlying atmosphere. The lowest I/F values describe a 
contiguous oval region at 14° S latitude and 173° W longitude, covering 
an area of 40 X 60 km? defined by 21 spectra, which are identical to 
within the noise (Figs 2 and 3). This ‘dark oval’ is observed on the TA, 
T10 and T35 Titan fly-bys of October 2004, January 2006, and August 
2007. It is unusual, covering an area 0.05% of that sampled. Within 7% 
of this I/F are four slightly brighter terrains. 

We derive the surface albedos of the dark oval feature using a 
discrete-ordinates radiative-transfer analysis with 32 streams (Fig. 3). 
This calculation was verified to correctly reproduce the surface albedo 
of the Huygens probe landing site, which was determined (at 0.4- 
1.6 um) from in situ measurements by the Huygens Descent Imager/ 
Spectral Radiometer (DISR). Radiative-transfer analyses of the dark 
oval spectra yield surface albedos consistent, at all windows, with a 
black surface (Fig. 3). The compounded uncertainties in the derived 
surface albedos, resulting from the VIMS calibration, the atmospheric 


Number of pixels 


0.063 


0.126 
WF 


0.189 


Figure 1 | Histograms of Titan’s I/F values at three windows. Values derive 
from the 4,096 pixels of the VIMS V1567239055 spectral image. The lowest 
I/F values include the dark oval feature (Figs 2 and 3) and represent abrupt 
lower limits to Titan’s I/F, consistent with that caused solely by scattering of the 
moon’s haze. Such an abrupt cut-off, not typical of Titan’s bright terrain 
(Supplementary Information), suggests a non-reflective surface, and calls for a 
radiative transfer analysis of the spectra. The detection of a black surface is 
particularly interesting, because Cassini radar images detect surface liquids 
through the presence of a black surface at 2.2 cm, which, at the 2% albedo level, 
requires a liquid depth of 8 m (refs 2, 17). Cassini near-infrared observations are 
more sensitive to the presence of shallow methane lakes than are Cassini radar 
observations, because methane liquid absorbs more strongly in the near- 
infrared window wavelengths, compared to at 2.2 cm. In principle, the variable 
cross-section of liquid methane in the near-infrared windows can constrain the 
depths of lakes from about 1 cm to about 2 m deep. 
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opacity, and noise, are 0.087, 0.036, 0.023, 0.020, 0.022, 0.021 and 
0.011, at the window wavelengths of 0.93 jim, 1.08 jim, 1.28 jim, 
1.58 um, 2.00 um, 2.78 {1m and 5.0 11m, respectively. Errors at the 
shortest four wavelengths are predominantly the lo error in the 
VIMS calibration; those of the three longest wavelengths are due to 
uncertainties in the atmospheric opacity (Supplementary Informa- 
tion). The 0.93-jm window does not place strong constraints on the 
surface albedo, and therefore is discussed only in the consideration of 
relative surface albedos. These values agree with the uncertainties 
estimated independently from the comparison of the VIMS and 
DISR derivations of the landing site surface albedos’* (Supplemen- 
tary Information). Our finding of a surface with an albedo smaller at 
all windows than the above stated uncertainties is consistent with the 
5-uum albedo of the south polar lake Ontario Lacus, also determined to 
be black!°, from constraints on the scattered light of the atmosphere 
determined through a comparison of I/F measurements made at dif- 
ferent emission angles. The spectra indicate no evidence for an ethane 
feature, as detected in Ontario Lacus"’: the ratio of spectra in and out of 
the dark oval resembles features caused by the ratio of spectra of any 
dark to bright terrain at the same lighting angles, and its width and 
shape are consistent with the transparency of the window. Therefore, 
our spectral ratios, while compositionally inconclusive, are consistent 
with a black surface at all windows, since spectroscopic features require 
backscattered light. 

The absorption cross-section of liquid methane varies by several 
orders of magnitude, such that only 2 m is needed to cause a black 
surface albedo (to within a value of 0.02) in all windows, but less 
methane (for example, depths of 20cm and 1 mm at wavelengths of 
1.58um and 5.0,1m respectively) blackens particular windows’. 
Liquid methane deeper than about 1 m, depending on the albedo of 
the lake bottom, readily explains the dark oval’s surface albedos. Such 
surface optical properties have not been detected on any other moon, 
and are thus inconsistent with any of the solid terrains thus repre- 
sented. Fresnel reflection from a lake contributes to I/F at short 
wavelengths, where the surface is diffusely illuminated, but only by 
at most 2% (the value of an isotropically illuminated lake), which is 
within the error of our measurements. Particulates in liquid methane 
increase the lake-surface albedo. While too little is known to predict 
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Figure 2 | Images of Titan at three wavelengths 
that probe the surface. a, c and d, I/F values at 
1.3 pm, 1.6 um and 2.0 ttm from the VIMS 

V 1567239055 spectral image of the T35 fly-by 
indicate a feature at —14° latitude and 173° W 
longitude, defined by 21 spectra. This feature also 
appears in a lower-resolution image, V1567241480 
(b), that includes the probe landing site (marked 
with an X). Data were reduced with the standard 
VIMS flats, which were checked for consistency 
with images of the haze and its north-south 
asymmetry. The I/F values in these and other 
windows are used to derive surface albedos, the 
uncertainty of which increases with decreasing 
wavelengths. This trend occurs because sensitivity 
of I/F to the surface albedo increases with 
wavelength, owing to the decreasing atmospheric 
scattering efficiency. Uncertainties in the surface 
albedos, caused mainly by the calibration of the 
VIMS instrument, the uncertainties in the 
atmospheric opacity, and the noise, are discussed in 
the Supplementary Information. 
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particle concentrations, a simple estimate of their effects suggests that 
a lake on Titan is black (to within 2%) for expected haze concen- 
trations, assuming nearly spherical particles, and for sand-sized 
particles with concentrations smaller than 10° ° m~*, which is typical 
of terrestrial lakes. Mudflats and hyper-concentrated flows, such as 
those in terrestrial desert washes following a rainstorm, are needed to 
brighten the near-infrared albedo of methane lakes significantly 
(Supplementary Information). 

Our survey identified four other terrains, two of which are defined 
by more than three pixels, that have I/F values at 1.58 im, 2.0 um, 
2.8 um and 5 pm similar to that of the dark oval, within the uncertain- 
ties (Supplementary Table 2). Yet their I/F values at 0.938 um, 
1.08 pm and 1.28 jim are, in contrast, significantly higher than those 
of the dark oval, considering the relative uncertainties in the surface 
albedos, which are estimated to be lower than the absolute albedo 
determinations (Supplementary Figs 2 and 3, and Supplemen- 
tary Information). Given the absorptive properties of liquid methane, 
the spectra of these brighter terrains are consistent with a surface 
dampened by 1-9cm of liquid, depending on the albedo of the 
underlying surface. Such a wet surface, more transparent at 2.2 cm, 
would be difficult to distinguish using radar because it attenuates the 
2.2cm reflectivity by less than 5% (ref. 17). One of these brighter 
regions (at 7° S, 185° W) is interesting because, unlike the dark oval 
region, it has been observed with radar measurements, which indicate 
that it lies in a dune field. Much of this dune field is 15% brighter than 
the dark oval region at all near-infrared wavelengths, and thus shows 
no evidence of liquid. Previous work finds that the I/F variability 
results from compositional changes in the broad inter-dune regions’*. 
Whether inter-dune regions become wet, as occurs in the Namibian 
dunes on Earth, which share traits with Titan’s dunes*’, can be tested 
with high-resolution images. That these low albedos occur in the dunes 
implies that wet terrain is associated with low-altitude regions. The 
detected five dark terrains (Supplementary Table 2) suggest that small 
lakes and swamps sparsely dot Titan’s tropical surface; the determina- 
tion of their depths and extents require additional near-infrared and 
radar measurements. 

The dark oval region has been present since 2004, several years before 
the arrival of seasonal tropical clouds*”**. Its presence during the dry 
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Figure 3 | Ten spectra of the dark oval compared to a black surface model. 
Spectra of the exceptionally dark feature at 14° S and 173° W match the 
simulated spectrum produced from a radiative transfer model that assumes at 
all wavelengths a zero surface albedo. All ten spectra (VIMS cube V1567239055, 
Fig. 3) appear for wavelengths less than 3.3 um. The 21 spectra have I/F values 
and 3c errors of 0.1365 + 0.0006, 0.1046 + 0.0006, 0.0644 + 0.0006, and 
0.0474 + 0.0009, at 1.08 um, 1.28 ttm, 1.58 um and 2.0 jim; the differences 
between the spectra precisely match the noise level of the data. The noisier 4.7- 
5.1 um I/F values are an average of the ten spectra, shown with 2c error bars 
(right inset). Synthetic spectra were calculated with radiative transfer models, 
assuming the in situ derived haze and methane parameters''”’, with two 
alterations indicated by laboratory studies**. The methane coefficients are 
adjusted slightly, by adding 0.02 (km amagat) ' to all values”*, except those 
defining the 0.93 um window, where earlier values” better characterize the 
spectra’. Also, the haze phase function determined above 80 km by DISR is 
used for the entire atmosphere'*. At wavelengths longer than 1.6 1m, 
absorption coefficients of CHy, CH3D and CO are calculated with line-by-line 
analyses of HITRAN parameters”. The 1.9-2.3 tm analysis includes pressure- 
induced absorption due to H)-N2, H2-H2, and CHy-Nz, assuming a H, 
abundance of 0.1% (ref. 14), and methane absorption”. At 2.6-3.1 um, the haze 
single-scattering albedo is set to 0.65, which reproduces the opacity structure’. 


season and its long lifetime argue against a rain puddle, which would 
evaporate quickly®. Given that methane lakes are inherently unstable on 
Titan’s tropical surface, their presence points to a subsurface source of 
liquid methane, that is, an oasis. Methane seepage is also indicated by 
the Huygens’ landing site’s stubby channels”, which opens up the 
question of whether subterranean methane played any part in the 
formation of Huygens’ heavily eroded landing site, with its damp flood 
plain, bordered by 100 m ridges. It is situated in a vast dune field. 

General circulation models demonstrate that long-lasting tropical 
lakes several metres deep must be replenished, depending on the 
ethane content’*”, within a ten-thousand-year timescale**®. Taken 
together, tropical lakes and studies of Titan’s lakes suggest that, cur- 
rently, subterranean liquid supplies methane to Titan’s surface and 
atmosphere. A supply of on average 6 X 10 *kgm *yr ‘ is needed 
to explain the composition of Titan’s atmosphere, because methane, 
the progenitor of the moon’s organic species, is destroyed in 10-100 
million years through solar ultraviolet photolysis”®. More observations 
are needed to determine whether this 4.5-billion-year-old moon is 
undergoing a specific recent flourish of geological activity, because it 
is freezing and its orbit decaying”. 
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A signature of cosmic-ray increase in AD 774-775 


from tree rings in Japan 


Fusa Miyake’, Kentaro Nagayal, Kimiaki Masuda! & Toshio Nakamura? 


Increases in '*C concentrations in tree rings could be attributed to 
cosmic-ray events!’, as have increases in '°Be and nitrate in ice 
cores*’. The record of the past 3,000 years in the IntCal09 data 
set'®, which is a time series at 5-year intervals describing the '*C 
content of trees over a period of approximately 10,000 years, shows 
three periods during which “C increased at a rate greater than 3%o 
over 10 years. Two of these periods have been measured at high 
time resolution, but neither showed increases on a timescale of 
about 1 year (refs 11 and 12). Here we report '*C measurements 
in annual rings of Japanese cedar trees from AD 750 to AD 820 (the 
remaining period), with 1- and 2-year resolution. We find a rapid 
increase of about 12%o in the '“C content from AD 774 to 775, which 
is about 20 times larger than the change attributed to ordinary solar 
modulation. When averaged over 10 years, the data are consistent 
with the decadal IntCal '*C data from North American and 
European trees'*. We argue that neither a solar flare nor a local 
supernova is likely to have been responsible. 

We used two individual Japanese cedar trees (tree A and tree B). We 
collected two series of measurements of the C content (A'‘c, see 
Fig. 1 legend) of tree A. The first consists of biennial measurements 
from AD 750 to 820. The second consists of yearly measurements from 
AD 774 to 780. The data for overlapping years match within measure- 
ment errors, confirming that the two series of measurements are repro- 
ducible. The measurements of '*C content in tree B were collected at 
1-year resolution, from AD 770 to 779. The data from tree A and tree B 


A Tree A 
O Tree B 


780 800 
Year AD 


760 820 
Figure 1 | Measured radiocarbon content and comparison with IntCal98. 
The concentration of C is expressed as A'C, which is the deviation (in %o) of 
the 'C/'*C ratio of a sample with respect to modern carbon (standard sample), 
after correcting for the age and isotopic fractionation®’. a, AC data for tree A 
(filled triangles with error bars) and tree B (open circles with error bars) for the 
period aD 750-820 with 1- or 2-year resolution. The typical precision ofa single 
measurement of AC is 2.6%o. Most data were obtained by multiple 
measurements, yielding smaller errors. Error bars, 1 s.d. b, The decadal average 
of our data (filled diamonds with error bars) compared with the IntCal98 data’ 


are consistent (reduced y? = 1.3, degrees of freedom d.f. = 10). These 
data are presented in Supplementary Information. 

Figure 1a shows the variation of '*C content of Tree-A (after the two 
series of data were combined) and Tree-B for the period AD 750-820. 
In our data, we observe an increase of !“C content of 12%o within 1 year 
(AD 774-775), followed by a decrease over several years. The signifi- 
cance of this increase (AD 774-775) with respect to the measurement 
errors is 7.20. 

In order to compare our results with IntCal98 (ref. 13), we averaged 
the yearly data to obtain a series with decadal time resolution. The 
result is shown in Fig. 1b. In the IntCal98 data, the MC content 
increased by about 7.2%o over 10 years (AD 775-785). The two series 
are consistent with each other within measurement errors. The event 
causing the increased 14C content in AD 775 could not have been local, 
because the IntCal data were obtained from North American and 
European trees, whereas we used Japanese trees. 

To have produced a large number of '*C nuclei in the atmosphere in 
AD 775, the cosmic-ray intensity must have increased considerably. 
The decadal record of another cosmogenic nuclide, 10Be can be 
obtained from the layers of ice or snow from Dome Fuji in 
Antarctica. These data include the relevant period, and exhibit a sharp 
peak in the Be flux around aD 775 (ref. 14). However, the dating of ice 
core layers is more ambiguous than that of tree rings. The age of a layer 
is determined by locating several well-known volcanic events, and 
matching the production rate pattern of '°Be with the '*C production 
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(open squares with error bars), which is a standard decadal A““C time series. Six 
standard samples (NIST SRM4990C oxalic acid, the new NBS standard) were 
measured in the same batch of samples. Because A'“C is calculated as the 
deviation of the '“C/"°C ratio of a sample with respect to an average of '*C/17C 
of the six standard samples, the errors are the resultant of error propagation. An 
error for a sample is a statistical one from a Poisson distribution, and an error 
for the standard sample is the greater of either averaged statistical error from a 
Poisson distribution of AC for the six standard samples or the s.d. of values of 
4C/°C for six standard samples. 
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record reconstructed using the IntCal data. Although we cannot say 
with certainty that the 10Be peak in the ice core occurred in AD 775, it is 
possible that the two peaks have the same cause. We take the agree- 
ments as further circumstantial evidence that the event was global. 

To model the tree ring data, we simulated the temporal variations of 
‘“C content (using a four-box carbon cycle model) after a short-term 
increase in the ‘*C production rate (based on a three-box model)”. 
Further details of this model are given in Supplementary Information. 
Using this model, we can calculate the hypothetical ‘*C production 
rates needed to explain a rapid increase in the annual time series, for 
input durations of 0.1, 0.5, 1, 2 and 3 years. The best-fit values of the 
input '“C production rate are provided in Supplementary Information. 
The model shows best agreement with tree ring data (Fig. 2) for a spike 
in “C production lasting less than 1 year. However, owing to the 
annual resolution of the '*C data, we cannot assess the duration of 
this spike in more detail. Nevertheless, as the input period increases to 
>1 year, the agreement of the model with the measured data decreases. 
Therefore, the present data are consistent with a short-term, high- 
energy event producing '“C, followed by a gradual decrease of '*C 
content due to the global carbon cycle. 

If the input period of '*C production was 1 year, the production rate 
must have been 19atomscm *s_' (see Supplementary Information) 
to explain the effects of this event. This is about 10 times larger than the 
global average production rate by galactic cosmic rays (2.05 atoms 
cm *s |; ref. 16). 

The increment of '*C content in ap 775 was about 12%. The source 
cannot be the solar cycle (that is, the Schwabe cycle), which on average 
has an 11-year period and an amplitude of 3%o with respect to its effect 
on the atmospheric '““C concentration®. An increase of 12%s in 1 year is 
about 20 times larger than expected from the Schwabe cycle. Only two 
known phenomena can change the cosmic-ray intensity within 1 year: 
a supernova explosion or a large solar proton event (SPE). 

First we consider the increase of '*C content due to a supernova 
explosion. In this case, y-rays can produce '*C because y-rays are un- 
affected by the Galactic magnetic field, unlike other charged particles 
from supernova explosions. The production mechanism is the reaction 
"*N(n,p)'4C from secondary neutrons of energy 10-40 MeV produced 
in the cascade from hard y-rays in the atmosphere. No detectable 
increase in '*C corresponding to supernovae SN 1006 and SN 1054 
was reported’, and the energy of the event in AD 775 at the Earth must be 
larger than these. We assume that the differential energy spectrum of 
y-ray emission from a supernova is described by a power law with an 
index of —2.5 (ref. 4). By integrating over y-ray energies above 10 MeV, 
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Figure 2 | Comparison of our data with a four-box carbon cycle simulation. 
Filled diamonds represent the A'C values of our data, and lines represent an 
expected change by a four-box carbon cycle simulation. Various lines represent 
different cosmic-ray input durations of 0.1, 0.5, 1, 2 and 3 years. The AC value 
of the simulation in AD 773 is fixed at a value calculated by the weighted average 
of the three data from AD 770 to 772. Error bars, as in Fig. 1 legend. 
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we obtain a ‘“C production yield of 1.2 x 10° “Catomserg *. We 


computed the production yield of '*C due to y-rays using the 
GEANT4 simulation code with QGSP-BERT-HP”, which is valid 
for thermal neutron interactions. Based on this figure, the incident 
y-ray energy necessary for this increase of '*C content in the atmo- 
sphere is about 7 X 10** erg. If the distance of the supernova were the 
same as that of SN 1006 (2 kpc; ref. 18), the total y-ray energy would be 
3 X 10°’ erg. This energy release is 100 times larger than the y-ray 
energy release from a normal supernova assuming that 1% of total 
supernova energy goes to y-rays and that emission of energy is iso- 
tropic (typical total supernova energy is of the order of 10°’ erg). 
Therefore, the supernova was closer than 2kpc, so that the total 
y-ray energy release is 3 X 10°’ erg, which is a typical supernova 
energy. However, although there are no historical records of a super- 
nova visible in the Northern Hemisphere around ap 775, there are 
historically unrecorded supernova remnants: for example, Cassiopeia 
A, which was found by radio observations, or Vela Jr (RX 
J0852.0—4622), which was found by the COMPTEL y-ray observatory, 
based on the “Ti line; the distance to Vela Jr is hundreds of parsecs and 
its age is 10°-10* years (refs 19-21). Therefore, we cannot rule out an 
undiscovered supernova remnant corresponding to the AD 775 event. 
But a supernova in AD 775 may be not probable, because a supernova 
that occurred relatively recently and relatively near Earth should still be 
tremendously bright (in radio, X-rays and **Ti), and such an object is 
not observed. 

Next we consider the case of an SPE. We assume that the flux of 
protons from an SPE as a function of rigidity (which is the momentum 
of the particle divided by the electric charge) is exponential: exp(— R/Ro), 
where R is the rigidity of protons and Rp is the characteristic rigidity of 
the SPE. Rp is set to 78 MV (ref. 5) in the following calculation. Unlike 
y-rays, protons reaching the Earth are blocked by the geomagnetic 
field. We applied predicted (using EXPACS*” software) vertical geo- 
magnetic cut-off rigidities on the Earth for an assumed geomagnetic 
field the same as the present field, and calculated the flux at intervals of 
10° in latitude, and obtained an average ‘*C production yield of 10 
“Catomserg ' using the GEANT4 code. The total proton energy 
necessary for this event was estimated to be 8 X 10° erg at the Earth, 
which corresponds to 2 X 10°” erg at the Sun and may be compared to 
the total proton energy of 10°°-10°” erg in a normal SPE”. 

Because there is a 30% increase in the decadal 'Be flux record in 
Dome Fuji from AD 755 to 785, we compared the production rate of 
'4C with that of '°Be (further discussions are presented in Supplemen- 
tary Information.) It is possible that an SPE with an extremely hard 
energy spectrum could explain simultaneously the ‘“C and '°Be results, 
but it would have to be much harder than any flare observed so far. 
Furthermore, an annual time series of !°Be flux would be necessary for 
a meticulous comparison. In fact, very large, energetic ‘super flares’ 
have been detected on normal solar-type stars. However, it is believed 
that a super flare has never occurred on our Sun, due to the absence of 
an historical record (such as a record of aurora and mass extinction 
caused by the expected destruction of the ozone layer**) and theoretical 
expectations’. 

With our present knowledge, we cannot specify the cause of this 
event. However, we can say that an extremely energetic event occurred 
around our space environment in AD 775. In the future, other high- 
resolution records (such as ‘Be and nitrate data), together with careful 
research of historical documentation around AD 775 and further surveys 
of undetected supernova remnants, may help us to clarify the cause. 
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‘Big Bang’ tomography as a new route to 
atomic-resolution electron tomography 


Dirk Van Dyck! & Fu-Rong Chen? 


Until now it has not been possible to image at atomic resolution 
using classical electron tomographic methods‘, except when the 
target is a perfectly crystalline nano-object imaged along a few zone 
axes’. The main reasons are that mechanical tilting in an electron 
microscope with sub-angstrém precision over a very large angular 
range is difficult, that many real-life objects such as dielectric layers 
in microelectronic devices impose geometrical constraints and that 
many radiation-sensitive objects such as proteins limit the total 
electron dose. Hence, there is a need for a new tomographic scheme 
that is able to deduce three-dimensional information from only 
one or a few projections. Here we present an electron tomographic 
method that can be used to determine, from only one viewing 
direction and with sub-angstrém precision, both the position of 
individual atoms in the plane of observation and their vertical 
position. The concept is based on the fact that an experimentally 
reconstructed exit wave** consists of the superposition of the 
spherical waves that have been scattered by the individual atoms 
of the object. Furthermore, the phase of a Fourier component of a 
spherical wave increases with the distance of propagation at a 
known ‘phase speed’. If we assume that an atom is a point-like 
object, the relationship between the phase and the phase speed of 
each Fourier component is linear, and the distance between the 
atom and the plane of observation can therefore be determined 
by linear fitting. This picture has similarities with Big Bang 
cosmology, in which the Universe expands from a point-like origin 
such that the distance of any galaxy from the origin is linearly 
proportional to the speed at which it moves away from the origin 
(Hubble expansion). The proof of concept of the method has been 
demonstrated experimentally for graphene with a two-layer struc- 
ture and it will work optimally for similar layered materials, such as 
boron nitride and molybdenum disulphide. 

Consider a coherent plane electron wave that interacts with a single 
atom. If we assume the atom to be a single point, it acts as a source for a 
spherical wave (Ewald sphere) that propagates to the plane of detection 
(the image plane), where it interferes with the spherical waves emitted 
by the other atoms. 

Using focal series reconstruction** or off-axis holography’, it is 
possible to reconstruct the exit wave of the object (in the future this 
might even be possible using phase plates*). The challenge is how to 
determine the three-dimensional position of every individual atom of 
the object from the exit wave. Every spherical wave can be decomposed 
in terms of Fourier components. In the Fresnel approximation for the 
spherical wave, which is valid for high-energy electrons, the phase of 
each Fourier component varies linearly with increasing distance from 
the source and is given by n/g’f, where / is the wavelength, g is the 
spatial frequency and f is the focal distance between the atom and the 
plane at which the exit wave is reconstructed. Thus, if we select the exit 
wave around the projection ofa particular atom, Fourier transform the 
wave and plot the respective phases of the Fourier components as 
function of the square of the spatial frequency, we obtain a straight 
line. This plot is analogous with the Hubble plot”* in cosmology, which 
shows that the distance and recessional speed of a distant galaxy are 


related linearly. By linear fitting of our plot, we obtain the vertical 
distance from the atom to the plane of observation (the reconstructed 
exit wave). In our analogy, this distance is the counterpart of the time 
between the Big Bang’”® and the present (Fig. 1). 

We assume that for high-energy electrons the scattering is forward 
and that a single atom is a weak-phase object’. This allows us to neglect 
multiple scattering and electron propagation inside the atom. Within 
the weak-phase object approximation, the electron wavefunction 
immediately behind the atom, that is, on the same side as the plane 
of observation, is then given by 


W(r)=1+iV,(7) (1) 


Universe 
P=0 


Present 
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Figure 1 | Big Bang analogy. a, b, Comparison between the Big Bang (a) and 
the point-atom ‘big bang’ (b). c, Phase speed plotted against phase. The 
relationship between the two is the same as that expressed in cosmology by 
Hubble’s law, which gives the linear relationship between the distance and the 
speed ofa distant galaxy. Here the slope is the reciprocal focal distance, 1/f Note 
that at the position of the atom, the phase of the atom wave does not start from 
zero; instead, it has a value, g,, characteristic of the atom. d, Phase plotted 
against phase speed, which we refer to as the Hubble plot here. The slope gives 
the focal distance between the emitting atom and the plane of reconstruction of 
the exit wave. e, Same as in d, but with a minor residual spherical aberration 
with C, = 0.3 um (see text). 
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where V,(r) is the projected electrostatic potential of the atom and r is 
the distance to the centre of the atom in the plane of projection. 

When the electron can propagate freely behind the atom, the elec- 
tron wavefunction, \,(r, z), at a distance z measured from the centre of 
the atom is given in the Fresnel approximation by convolution (@) 
with the Fresnel propagator p(r, z), which is the parabolic approxi- 
mation of the spherical wave: 


Wr, 2) =W(r)@pl(r, z)=1+ iV,(r)@p(r, z) 


Fourier transformation then yields 
alg) =F el, 2)) = 0g) +if"(g) exp (inig’f) 


= d(g) + f"\(g)exp(i(n/2+ nig’f)) 


where 6 is the Dirac delta function. The modulus of Wa(g) is the 
scattering factor, f(g), of the atom, which is the Fourier transform 
of the atom potential, V,,(r). Because V,,(r) is real and rotationally sym- 
metric, f“'(g) is also real and rotationally symmetric. Thus, it does not 
contribute to the phase of the Fourier components. Note that the factor 
of i produces an offset phase shift of m/2. The difference between @, the 
phase at the defocus distance f, and ,, the phase at defocus 0, is more 
generally formulated as 


9-9 =TAg’f (2) 
We can rewrite equation (2) as 
1 
F090) (3) 
where the factor of 1/f is equivalent to the Hubble constant, H,, in 
Hubble’s law. In the weak-phase object approximation, ~, = 7/2. 

Note also that the expression in equation (2) is rotationally sym- 
metric. This allows us to perform rotational averaging to reduce noise, 
without losing any information. Figure 1c shows a theoretical “Hubble 
plot’ from equation (3). Unlike in the astrophysics case, we do not have 
to measure the phase speed, m/g”, because we know it from theory. 
Hence, it is more appropriate for our purposes to switch the axes and to 
treat the phase speed as the independent variable (Fig. 1d). The slope in 
Fig. 1d is the distance between the plane of the reconstructed exit wave 
and the emitting atom. The projected positions of the atoms in the 
plane of projection can be obtained with picometre precision by com- 
parison of the phase maxima with an ideal lattice. 

In practice, an exit wave can be reconstructed from high-resolution 
electron microscopy images using either a weighted combination of 
images taken at different focus values (focal series reconstruction)** or 
a hologram?® obtained by interference with a reference wave. However, 
to maximize the resolution we must not eliminate the residual aberra- 
tions but instead properly balance them. Incoherent aberrations, such 
as temporal and spatial incoherence’ and isotropic vibrations of the 
atoms and of the microscope, will mainly cause an isotropic blurring of 
the amplitude of an atom wave in real space, but not of the phase, and 
therefore affects only the precision’ of the atomic position measure- 
ment. Coherent aberrations such as defocus, spherical aberration and 
astigmatism affect only the phase in Fourier space. The effect of 
residual spherical aberration, which is proportional to g*, causes a 
parabolic curvature for large values of g. It turns out that our method 
is so sensitive that by quadratic fitting we can determine the spherical 
aberration constant, C,, with a precision of more than 1 jum. Figure le 
shows a typical example of a Hubble plot with the same focal distance 
as in Fig. 1d but a spherical aberration with an aberration constant of 
C, = 0.3 um (for 80-keV electrons). As a result of this aberration, the 
plot deviates from a straight line. By fitting the curve with a quadratic, 
we can determine the residual spherical aberration constant with sub- 
micrometre precision. From the angular dependence of the atom wave 
in Fourier space, we can in principle also determine the non-symmetric 
higher-order residual aberrations, but for the moment we assume that 
these aberrations as well as the incoherent aberrations can be suffi- 
ciently corrected in the electron microscope. 


TAg? = 
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We have successfully applied our method to the study of graphene 
using both simulations and experimentally reconstructed exit waves of 
single- and double-layer graphene observed with a C,-corrected elec- 
tron microscope at 80 keV. The experimental data were obtained from 
ref. 11. The graphene wave was reconstructed from a focal series of 19 
high-resolution transmission electron microscope images''. The 
residual aberrations of the graphene exit wave were corrected up to 
third order by applying a numerical phase plate and by quantitative 
comparison with a simulated graphene exit wave’. Graphene is a very 
challenging test object for our technique because carbon atoms are 
very light (weak scatterers) but the distance between neighbouring 
atoms is very small (1.4 A), with the result that the spherical wave of 
an atom is sensitive to interference from neighbours. However, the 
theoretical distance between the graphene layers in the double sheet is 
well known, and its determination therefore provides an excellent test 
of our method. 

Figure 2 shows the phase of the exit wave of a layer of graphene that 
is partly overlapped bya second layer. The position analysis was carried 
out atom by atom, and because the theoretical positions of the atoms 
are known, we can estimate the statistical precision that can be 
obtained. There are four types of atom: those in the single layer 
(red); those in the lower layer that do not superpose with those in 
the upper layer (green); those in the two layers that superpose (brightest 
phase peak; blue); and those in the upper layer that do not superpose 
with those in the lower layer (black). 

The analysis of the exit wave is done in the following steps. (i) 
Although the original sampling of 0.00937 nm per pixel obeys the 
Nyquist criterion, such that no information is lost, we need to sub- 
sample the exit wave using spline interpolation at up to 0.00268 nm per 
pixel (Fig. 3a) to process the data further. (ii) Figure 3b shows a sub- 
sampled area of Fig. 3a. The positions of the atoms are determined by 
fitting the phase peaks with Gaussian functions. The red crosses in 
Fig. 3b are the positions of the maxima of the fitted Gaussians. (iii) In 
Fig. 3c, the atom wave is isolated with a circular mark with a radius of 
0.07 nm, which is half of the interatomic distance. To avoid the 
artefacts from the Fourier transform of a sharp circular window, we 
soften its edges. The background around the atom is estimated by 
fitting from the pixel values at the edge of the mask. (iv) A square 
patch of side length 0.14 nm around the isolated atom peak is selected. 
The modulus and phase of the isolated atom are shown colour-coded 
in Fig. 3d. (v) The background is calculated from the pixel values 
outside the mask and subtracted from the pixel values inside the mask. 
The modulus and phase of the isolated atom after background sub- 
traction is shown in Fig. 3e. The value of the background wavefunction 


9 neighbour atoms (2 atoms overlap, 
brightest atoms) 
6 neighbour atoms 


Type 1 
3 neighbour atoms 


Figure 2 | Phase of the exit wave of a two-layer graphene object. Four 
different types of atom are distinguished: type 1 (red), type 2 (blue), type 3 
(green) and type 4 (black) (see text). 
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Figure 3 | Steps in the Hubble analysis. a, Subsampling of the exit wave. 

b, Finding the peak positions of the atom wave. ¢, Isolating the atom wave using 
a soft mask. d, Modulus and phase of the isolated atom wave. e, Modulus and 
phase of the isolated atom wave after background subtraction. f, Phase contour 
map of the Fourier transform of e. g, Phase contour map of the rotationally 
averaged Fourier transform. h, Hubble plot. The phase value is extracted from 
the red line in g. 


for an ideal weak-phase object is ¢(r) = 1 as described in equation (1). 
We note that after background subtraction, the modulus of the wave 
should maintain the shape of V(r), whereas the phase should be a 
constant-phase plateau that is equal to 1/2 for a weak-phase object. 
(vi) The background-subtracted data (Fig. 3e) is Fourier transformed. 
The phase contour map of the resulting Fourier transform is displayed 
in Fig. 3f. (vii) The Fourier transform of the wave (Fig. 3f) is rotationally 
averaged. The resulting phase contour map is depicted in Fig. 3g. (viii) 
The phase value ¢ (from the centre of the rotationally averaged wave) is 
plotted against phase speed, n/g” (in A’) (Fig. 3h). This is the Hubble 
plot described by equation (2). The depth of focus, f, that is, the distance 
between the atom and the plane of the exit wave, can be determined by 
linear regression of this plot. 

By comparing these results with the ‘correct’ values given by the 
ideal lattice, using the method described above, we were able to deter- 
mine the position of every atom in the plane of observation with an 
accuracy of about 7-10 pm as compared with the correct values given 
by the ideal graphene lattice (Supplementary Information and 
Supplementary Fig. 1). We were also able to determine the vertical 
position of every individual carbon atom to a precision of about 0.2 A. 
Because every atom is measured independently, the standard deviation 
of the peak in the histogram of the defocus values, f (Fig. 4e), provides 
an internal measurement for the standard deviation of the whole 
fitting procedure. 

Figure 4a-d shows the Hubble plots for four different types of atom 
in the graphene exit wave. In total, there are 143, 111, 115 and 100 
atoms of types 1-4, respectively. The histogram of the distances of all 
the analysed atoms is given in Fig. 4e. The averaged f values for the four 
types of atom are — 0.26, —0.22, —0.18 and —3.5 A, respectively. This 
corresponds to a layer separation of 3.28 Aand shows that the graphene 
structure has a flat bottom. This separation is close to that of ideal two- 
layer graphene, which is 3.35 A (ref. 12). As shown in Fig. 4f, the flat- 
bottom model shows that the atoms of types 1 (red), 2 (blue) and 3 
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Figure 4 | Hubble plots and histogram of the focal distance. a-d, Hubble 
plots. e, Histogram of f for four different types of atom. f, The flat-bottom 
model. f,, average focal distance; o, standard deviation. g, Subtypes of atoms of 
type 4. h, Subtypes of atoms of type 1. Note that the sub-colours do not mean 
the same as in Fig. 2. 


(green) are in the bottom layer and that those of type 4 (black) are in the 
top layer. 

To determine the quantitative limits of precision, we analysed in 
detail the spread in the histograms of the vertical positions. The pre- 
cisions in vertical position for atoms of types 1-4 can be derived from 
the standard deviation of the histogram as 0.3, 0.19, 0.26 and 0.87 A, 
respectively. To analyse further the origin of the larger spread for 
atoms of type 4, we subdivided the histogram for these atoms into four 
segments, each containing the same number of atoms (Fig. 4e, blue 
dashed lines), colour-coded magenta, brown, pink or purple according 
to segment, and analysed them in real space. As shown in Fig. 4g 
(where atoms of types 1-3 are coloured yellow), atoms of type 4 with 
different focal distances are distribute randomly rather than system- 
atically. Because we have a flat-bottom structure (Fig. 4f, where the 
original atom colour coding is used), each atom of type4 has nine 
nearest-neighbour atoms (three in the top layer and six in the bottom 
layer; Fig. 2). There are, however, three near neighbours for atoms of 
type 1 and six for atoms of types 2 and 3. It is possible that the large 
spread may be due to a poor signal-to-noise ratio, which may arise 
from the influence of nearest-neighbour atoms, because atoms of 
type 4 are most highly influenced in this way. 

When analysing the histogram of the atoms of the single sheet 
(type 1), we notice a systematic difference in vertical position between 
two subtypes of these atoms. In Fig. 4e, the histogram for these atoms is 
divided at f= —0.22 A into two segments (red dashed line), with ref- 
erence to the positions of two subpeaks in the histogram. The atoms 
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were coloured grey or white if the associated focal values were less than 
or, respectively, greater than —0.22 A. As shown in Fig. 4h, the differ- 
ence between the average vertical positions of the white and grey atoms 
in the histogram suggests that the single layer may have a buckled 
structure, but we must still consider the possibility that this effect is 
due to a very small asymmetric aberration. Work on this is in progress. 

If we discard the few outlying atoms of type 4 (those for which fis 
greater than —2.2A or less than —4.8 A), the remainder have an 
average focal value of —3.5 A. The three-dimensional structure 
deduced from our Hubble analysis of the experimental graphene exit 
wave can then be formed by distinguishing the outlier atoms from the 
rest. Normal and perspective views of this three-dimensional structure 
are shown in Supplementary Fig. 2b and Supplementary Fig. 2c, 
respectively (see Supplementary Information, Supplementary Fig. 2 
and Supplementary Movie 1 for more detail). 
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Acanthodes and shark-like conditions in the last 
common ancestor of modern gnathostomes 


Samuel P. Davis", John A. Finarelli* & Michael I. Coates? 


Acanthodians, an exclusively Palaeozoic group of fish, are central to 
a renewed debate on the origin of modern gnathostomes: jawed 
vertebrates comprising Chondrichthyes (sharks, rays and ratfish) 
and Osteichthyes (bony fishes and tetrapods)’ *. Acanthodian internal 
anatomy is primarily understood from Acanthodes bronni*’-° 
because it remains the only example preserved in substantial detail, 
central to which is an ostensibly osteichthyan braincase’”’. For this 
reason, Acanthodes has become an indispensible component in 
early gnathostome phylogenies’’*”’. Here we present a new descrip- 
tion of the Acanthodes braincase, yielding new details of external 
and internal morphology, notably the regions surrounding 
and within the ear capsule and neurocranial roof. These data 
contribute to a new reconstruction that, unexpectedly, resembles 
early chondrichthyan crania. Principal coordinates analysis of a 
character-taxon matrix including these new data confirms this 
impression: Acanthodes is quantifiably closer to chondrichthyans 
than to osteichthyans. However, phylogenetic analysis places 
Acanthodes on the osteichthyan stem, as part of a well-resolved tree 
that also recovers acanthodians as stem chondrichthyans and stem 
gnathostomes. As such, perceived chondrichthyan features of the 
Acanthodes cranium represent shared primitive conditions for 
crown group gnathostomes. Moreover, this increasingly detailed 
picture of early gnathostome evolution highlights ongoing and 


Art.p.a Tt 


profound anatomical reorganization of vertebrate crania after the 
origin of jaws but before the divergence of living clades. 

Braincases are rich sources of morphological data*, and have been 
used repeatedly to build hypotheses of relationships among early 
osteichthyans, chondrichthyans and their extinct jawed relatives: 
placoderms and acanthodians'*”"*"**°, Notably, only a couple of 
acanthodian braincases have been subjected to thorough description, 
with only Acanthodes bronni’*"° preserved in multiple specimens, 
hence the importance of obtaining an accurate understanding of this 
exceptional material. Existing descriptions conform the Acanthodes 
cranium to osteichthyan’ or chondrichthyan’ models (Supplementary 
Fig. 1), of which the osteichthyan version is more widely accepted’*"*"’. 
With more early gnathostome braincases now known!) '*!>!871, 
new questions have arisen concerning apparent similarities 
between Acanthodes and its osteichthyan comparators, prompting this 
re-examination. 

Acanthodes bronni was originally collected from Early Permian 
deposits (Sakmarian—Asselian, ~290-296 million years ago”) of 
Lebach, Saar-Nahe basin (southwestern Germany)””®, making A. bronni 
among the latest-occurring acanthodian species. Conventionally, 
Acanthodes is assigned to the Acanthodidae subdivision of the 
Acanthodii on the basis of its single dorsal fin, slender branchiostegals, 
pelvic fin proximity to pectoral fins, and absence of intermediate fin 


Figure 1 | Acanthodes bronni cranial 
reconstruction. a, b, Braincase (a) and braincase, 
jaws and hyomandibula (b) reconstructed in lateral 
view. c, Braincase dorsal ossification photographed 
in lateral view. d, Dorsal ossification articulated 
with palatoquadrate and hyomandibula, 
photographed in dorsolateral view. e, Dorsal 
ossification and basisphenoid photographed in 
anterior view. Ahm, hyomandibular articulation; 
Alop, anterolateral otic process; Art.p.d, dorsal 
postorbital articulation; Art.p.v, ventral postorbital 
articulation; Bsph, basisphenoid; Btp, 
basipterygoid process; Dor, dorsal ridge; Fos, fossa; 
Hm.v, hyomandibula ventral ossification; Jc, 
jugular canal; Jg, jugular groove; Lat.com, lateral 
commissure; Lop, lateral occipital plate; Lor, lateral 
otic ridge; Mcv, foramen for middle cerebral vein 
and anterodorsal lateral line nerve; Mpt, 
metapterygoid; Oof, otico-occipital fissure; Pop, 
postorbital process; Psc, posterior semicircular 
canal; Qu, quadrate; Tfr, trigemino-facial recess; 
IX, glossopharyngeal nerve exit. All photographs of 
NMS 2001.7.1 except c, basisphenoid from 
2001.7.3. Scale bar, 10 mm. 


Art.p.v 
Lat.com 
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spines’. Specimens (Supplementary Table 1) consist of natural moulds 
preserved in clay-ironstone nodules. Here, new silicone rubber casts 
were taken from especially well-preserved specimens of similarly sized 
individuals, including National Museum of Scotland (NMS) 2001.7.1 
(cast of Humboldt University MB 3b) and NMS 2001.7.3 (cast of 
University Museum of Zoology, Cambridge GN12, formerly DMSW 
495). These new data have allowed a more comprehensive reconstruc- 
tion of the cranium than available in previous works (Fig. la, b; see 
Supplementary Information). 

Most of the new and revised details concern the dorsal ossification 
of the braincase’ (Fig. 1c-e and Supplementary Figs 2-10, 12-15), the 
completeness of which varies considerably among specimens. The 
dorsal ossification encloses most of the brain cavity, the flanking ear 
capsules, and the walls and processes separating the cavity and 
capsules from the orbits. As such, this cranial unit preserves important 
information regarding the organization of associated soft tissues, 
adding essential new information to test alternative hypotheses about 
the morphology” and relationships’ of this unique taxon and the 
larger clade or grade of acanthodian fishes. 

Differences from the standard osteichthyan model are numerous. 
Notably, the braincase roof bears a median dorsal ridge (Fig. 1c, d and 
Supplementary Figs 3a, 4) flanked by broad fossae (Fig. 1d) that 
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terminate posteriorly above the external openings of endolymphatic 
ducts (Supplementary Fig. 5). This ridge-fossae-duct complex is 
markedly similar to conditions in early chondrichthyans*’*"*”°. The 
ridge is absent in existing descriptions of Acanthodes””, and only parts 
of the endolymphatic ducts, visible on the underside of the cranial roof 
(Supplementary Fig. 3b), had been previously recognized’. 

Details of the orbit hind wall (Fig. le and Supplementary Figs 6-8) 
mostly agree with earlier reports, but the wide embayment in the 
ventral margin deserves particular attention. This has previously been 
identified as the trigeminofacial chamber’, the trigeminal nerve exit’, 
or the jugular canal roof’. We support the interpretation as the jugular 
canal roof (Figs le and 2a, b): the embayment is aligned with the 
jugular groove (Figs 1c and 2c, d), and the canal-space outer wall 
positionally matches the lateral commissure (Figs 1c, d and 2a-d) of 
other braincases”’’. The large foramen dorsolateral to this jugular 
canal has also been variably interpreted”*”°; we support the view that 
it transmitted the middle cerebral vein (Figs le and 2a, b)’”™*. 

A prominent lateral otic ridge traverses the external surface of the 
otic capsule (Figs 1c, 2c, d and Supplementary Fig. 9), much as in early 
chondrichthyans’*'**°. The jugular groove (Figs lc, 2c, d and 
Supplementary Fig. 9) passes beneath this ridge, and below the groove 
projects the anterolateral otic process (Figs 1c and 2c-f). Similar 


Figure 2 | Acanthodes braincase: details of 
braincase morphology. a, b, Left trigeminofacial 
chamber anterior wall and rear of postorbital 
process, posterior view. c, d, Left postorbital 
process, trigeminofacial recess and otic capsule, 
posterolateral view. e, f, Right otic capsule, medial, 
parasagittal view. Blue, jugular vein location; 
orange, trigeminal (V) and facial (VII) nerve 
branches; pink, semicircular canal network. Asc, 
anterior semicircular canal; Cc, crus commune; 
Esc, external semicircular canal; Pa, posterior 
ampulla; Sac, saccular recess; Ss, sinus superior; Ur, 
utricular recess; II, optic (II) nerve exit; III, 
oculomotor (III) nerve exit; IX stp, 
glossopharyngeal (IX) nerve formanen, 
supratemporal branch. See Fig. 1 for other 
abbreviations. All photographs of NMS 2001.7.1 
except basisphenoid in c and d, from 2001.7.3; 

a reversed for comparison with c. Scale bars, 5 mm. 


projections are unknown in osteichthyans or chondrichthyans, 
although some topological resemblance is shared with the placoderm 
posterior postorbital process”*”*. The lateral otic ridge and anterolat- 
eral otic process are absent in previous descriptions”. Importantly, 
there is no evidence for a glossopharyngeal nerve foramen below the 
jugular groove, in contrast to osteichthyan interpretations of 
Acanthodes*”"*, 

The previously unknown internal surface of the otic capsule (Fig. 2e, f) 
resembles early chondrichthyan and osteichthyan examples**”*”° 
(Supplementary Fig. 10). The vestibular chamber lacks a medial wall, 
revealing a vertical trough for the sinus superior and crus commune 
(Fig. 2e, f) of the inner ear canal network. However, the groove for the 
external semicircular canal (Fig. 2e, f) joins the base of the sinus 
superior dorsal to the posterior ampulla recess (Fig. 2e, f). This 
resembles chondrichthyans””’ and placoderms””; in the majority of 
early osteichthyans*’”* the external canal and posterior ampulla are 
nearly, or actually, confluent (Supplementary Fig. 10b). Consistent with 
the absence of an external glossopharyngeal nerve foramen, there is no 
evidence of an internal opening, suggesting that nerve IX exited the 
cranium through the otico-occipital fissure (Fig. 1c, d), again resembling 
early chondrichthyans'”’®’°”” (Figs 1c and 2c—f). The anterior and vent- 
ral margins of the utricular and saccular spaces (Fig. 2e, f) are uncertain 
because those parts of the otic wall are incomplete. 

The large opening anterior to the vestibular region and behind 
the postorbital process contained the trigeminofacial recess** 
(Figs 1c and 2a-d), a characteristic feature of braincases in early 
modern gnathostomes and many extant fishes. This space maps 
closely to areas housing the dorsal and ventral roots and ganglia for 
nerves V and VII in osteichthyan and chondrichthyan crania 
(Supplementary Fig. 11). In Acanthodes, the anterior wall of this recess 
includes the internal opening of the canal that probably transmitted 
the middle cerebral vein’ (Fig. 2a, b). As in early chondrichthyans”, 
the paired articulations for the upper jaw (Figs 1c, d and 2a-d) are 
positioned on the rear of the postorbital process (Figs 1c-e, 2a—d 
and Supplementary Figs 7-9, 12). These articulations lie anterior 
and dorsal to the trigeminofacial recess, not on the otic capsule, as 
in the osteichthyan model’. There is no clearly defined area of articu- 
lation for the hyomandibula (Fig. 1d). We conclude that it was situated 
posteriorly on the lateral wall of the otic capsule (Figs 1c, d, 2c, d and 
Supplementary Figs 5, 9, 12). 

Given the numerous morphological features shared with 
chondrichthyans, it might be reasoned that Acanthodes, and perhaps 
all acanthodians, are stem chondrichthyans. Principle coordinates 
analysis (PCO)*’”? of a matrix of 60 taxa and 138 morphological 
characters (Supplementary Information) provides a means to quantify 


Placoderms Acanthodians 


-0.20 


Osteichthyes Chondrichthyes 


Figure 3 | PCO of early gnathostome character data. PCO 1 (17.5% 
explained variance) is plotted on the horizontal axis and PCO 2 (13.7%; left) 
and PCO 3 (10.1%; right) on the vertical axes. The four traditionally named 
groups (placoderms in green, acanthodians in red, osteichthyans in blue, 
chondrichthyans in purple) cluster in distinct and non-overlapping regions on 
the first three PCO dimensions. The two black points represent outgroups used 
in the phylogenetic analysis: Galeaspida and Osteostraci. 


LETTER 


this similarity. Acanthodians, placoderms, osteichthyans and 
chondrichthyans form discrete clusters along the first three PCO axes 
(Fig. 3). Results show (1) a significant disparity between placoderms 
and all other gnathostomes (Supplementary Tables 2 and 3); (2) that 
acanthodians form a coherent group, with acanthodians more similar 
to one another than to other gnathostome groups (Supplementary 
Table 2); and (3) importantly, that Acanthodes in particular, and 
acanthodians in general, are more similar to chondrichthyans than 
to osteichthyans (Supplementary Tables 3-5). 

In contrast to the results of these phenetic analyses, phylogenetic 
analysis resolves Acanthodes as a stem osteichthyan (Fig. 4a and 
Supplementary Tables 6-8), with other acanthodians distributed 
among osteichthyan, chondrichthyan and gnathostome stem groups. 
This phylogenetic result corroborates and builds upon a recent large- 
scale revision of the early gnathostome tree’, delivering the most 
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Figure 4 | Results of phylogenetic analysis, and early gnathostome 
braincases preceding conditions in modern jawed vertebrates. a, Strict 
consensus of 512 shortest cladograms; bold font signifies acanthodian genera; 
black branches indicate gnathostome stem group; coloured branches indicate 
the crown clade. bg, Braincases in lateral view, anterior to right; simplified 
cladogram (grey) on left summarizes interrelationships of illustrated taxa; 
vertical bar (maroon) aligns braincases at level of pituitary vein canal. 

b-d, Placoderm-grade taxa: b, Brindabellaspis'; c, Macropetalichthys; 

d, Dicksonosteus”. e-g, Crown group gnathostomes: e, Cladodoides 
(Chondrichthyes)”; f, Acanthodes (stem Osteichthyes); g, Mimia (crown 
Osteichthyes)”* (additional and primary data sources in Supplementary Table 6). 
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detailed estimate of early gnathostome relationships so far. We 
acknowledge that tree support statistics are generally weak, and that 
much of the branching pattern in the crown clade hinges upon single 
character inclusion (Supplementary Figs 16 and 17). Moreover, 
character/taxon exclusion tests reveal conflicting signals within the 
data set and the critical influence of Acanthodes on recovered relation- 
ships (Supplementary Figs 18-20). 

The phenetic and phylogenetic results are complementary and con- 
sistent with the hypothesis that many (perhaps all) of the chondrichthyan- 
like features of Acanthodes represent symplesiomorphies of crown 
group gnathostomes: compare Fig. 4e and g with f, and contrast with 
placoderm-grade conditions (Fig. 4b-d). The increasingly detailed 
estimate of early gnathostome phylogeny allows us to distinguish 
better among primitive and derived characteristics in chondrichthyans 
and osteichthyans (Supplementary Information). Character polarities 
illuminated by Acanthodes demonstrate that ancestral gnathostome 
conditions are not averages of living bony and cartilaginous fishes, 
but rather that the morphological dichotomy of modern jawed verte- 
brates is the result of an active diversification of osteichthyans away 
from an ancestral form similar to chondrichthyans. An important 
implication of this study is that the braincase of the last common 
ancestor of modern gnathostomes was effectively shark-like, of which 
Acanthodes is a narrow-based (‘tropibasic’) variant. 

Better-resolved hypotheses of early gnathostome phylogeny are 
delivering a clearer picture of the ongoing re-organization of the ver- 
tebrate head, involving transformations of paired sensory organ posi- 
tion and size, rerouting of cranial nerves and vascular architecture, and 
major changes in the structural relationship between the braincase, 
jaws and gill arches. Such ongoing modifications started well before the 
origin of jaws’, but, importantly, they continued afterwards, as is 
increasingly evident in the morphological disparity between members 
of the placoderm-grade and early crown group gnathostomes. 


METHODS SUMMARY 


Phylogenetic analyses were performed using the NCHUCK command to con- 
strain searched tree space. Heuristic searches using the TBR branch-swapping 
option, holding one tree per replicates, were run for 25 random sequence additions 
to estimate shortest tree length. This estimate, plus one step, was used as the 
CHUCKSCORE for a further heuristic search, with 10,000 random sequence 
additions, keeping a maximum of 500 trees above the chuckscore per replicate. 
Bootstrap analyses were run for 1,000 replicates, with 100 random sequence 
additions per replicate, setting MAXTREES = 1,000 to constrain searched tree 
space. In all analyses, outgroup taxa were constrained as a paraphylum. PCO 
was performed on the Hamming distance matrix calculated from the character 
data. Further details for all analyses and all associated references are available in 
Supplementary Information. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Phylogenetic analyses of the character matrix (60 taxa coded for 138 skeletal 
characters: Supplementary Table 7) were performed using PAUP* v.4.0b10°'. 
We report all tree lengths treating polytomies as soft. Data coverage is patchy 
(49.6% of the cells in the matrix are coded as uncertain/unknown) and the pro- 
portion of coded characters for individual taxa varies considerably (from 23.2% 
(Rhadinacanthus) to 89.9% (Mimia)). Here, we adopted tree-search strategies to 
efficiently explore tree space while simultaneously maximizing the probability of 
finding optimal islands under maximum parsimony. We used the NCHUCK 
command in PAUP* within heuristic searches (using the TBR branch-swapping 
option) to constrain the searched tree space. Heuristic searches were initially run 
for 25 random sequence additions to estimate the length of the shortest tree 
(termed TS). The value TS+1 was then used as the ‘chuckscore’ (the treescore 
at and above which a fixed number of trees were kept). A more comprehensive 
heuristic search was then run with 10,000 random sequence additions, keeping 500 
trees greater than or equal to the chuckscore (CHUCKSCORE = TS+1, 
NCHUCK = 500) for each replicate*’. 

We performed a parametric bootstrap*’ and calculated Bremer Decay indices* 
to assess support for the resolved nodes in the strict consensus of the most 
parsimonious cladograms (Supplementary Fig. 16). We performed 1,000 
bootstrap replicates using heuristic searches, with 100 random sequence additions 
per replicate. To prevent searches from becoming stuck on large tree islands, we set 
the maximum number of trees saved for each random sequence addition to 1,000 
(MAXTREES = 1,000). Adopting this strategy necessarily reduces the total 
amount of tree space searched during each random sequence addition. 
However, this compromise allows for both a larger number of random sequence 
additions per bootstrap replicate (exploring a greater breadth of tree space per 
random re-sampling of the character data), and a larger number of bootstrap 
replicates (exploring a greater number of character re-samplings). Bremer 
Decay indices were calculated using the Perl script AutoDecay (v.5)*, in conjunc- 
tion with PAUP**’. 

We performed additional analyses to test the sensitivity of our results to the data 
afforded by the new reconstruction of Acanthodes. We performed two taxon- 
subset analyses with all of the character data: (1) retaining Acanthodes, but remov- 
ing all other taxa that have traditionally been assigned to the Acanthodii 
(Supplementary Fig. 18a); and (2) deleting Acanthodes from the analysis 
(Supplementary Fig. 18b). To test the sensitivity of our results to character com- 
pleteness or character partitions, we performed a further pair of taxon-subset 
analyses. In the first, we included those taxa for which endoskeletal data are 
known, thereby including only the most completely coded taxa (Supplementary 
Fig. 19a). Second, we performed an analysis using only braincase character data 
(characters 1, 54-103), and including only those taxa for which the braincase is 
known (Supplementary Fig. 19b). In each of these supplementary phylogenetic 
analyses, we followed the tree-search strategy described above. 

For the PCO, raw data were converted to a distance matrix that was subse- 
quently decomposed” (Fig. 3). For discrete morphological character data, such as 
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in a cladistic character matrix, intertaxon distance was measured as the number of 
observed character state transitions between two taxa: the Hamming distance (Ha) 
(for example, see ref. 36). Thus, for two taxa coded for the same 50 characters, if 
there is only a single character state transition (that is, a single character coded 
differently between the two taxa), then Hy=1 (or the proportional 
Hg = 0.02 = 1/50). We normalized Hg, dividing observed character state transi- 
tions between each pair of taxa by the total number of coded characters shared 
between that pair, to obtain a proportional inter taxon difference ranging from 0 to 
1. That is, in the case of two fully coded taxa, if there were 10 observed character 
state transitions between the two, then the Hy would be 10/138 = 0.072. None of 
the taxa in this analysis are fully coded, so only those characters coded for both 
were considered for normalizing Hg. In addition, we treated each character as 
equally weighted in the PCO analysis and did not impose character state ordering, 
as neither was used in the cladistic analysis. 

The Hg matrix was subjected to statistical analyses to quantify the degree of 
similarity between the four major traditionally named gnathostome groups: 
placoderms, acanthodians, osteichthyans and chondrichthyans. We calculated 
the set of all within-group and all between-group intertaxon Hg values for each 
of the four traditionally named groups. To test hypotheses that the average Hq 
between taxa within any given named group was the same as the average distance 
across named groups, we performed t-tests” comparing within-group Hy for each 
group to its complement of three between-group sets, using Bonferroni correction 
for multiple comparisons*’. To test the hypothesis that average between-group 
distance is the same for each potential pairing of the traditionally named groups, 
we performed an analysis of variance” across these six sets of between-group Hy 
values (excluding within-group distances). Post hoc tests for the significance of 
individual pairwise between-group Hg values were performed, using the modified 
T-method to account for unequal sample sizes*’. For results of Phenetic analyses, 
see Fig. 3 and Supplementary Tables 1-4 for further discussion of methods used 
and commentary on results. 

Equipment and settings. All photographs in Figures and Supplementary 
Figures, except for those in Supplementary Figs 2, 10 and 12 were taken using a 
Leica DFC490 camera attached to a Zeiss Stemi SV6 microscope. Photographs 
were processed using Image-Pro Plus 6.2 software; in each instance a multiple 
image z-stack was created, aligned and processed using the enhanced depth of field 
option. 
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Covert skill learning in a cortical-basal ganglia circuit 


Jonathan D. Charlesworth!, Timothy L. Warren! & Michael S. Brainard! 


We learn complex skills such as speech and dance through a gradual 
process of trial and error. Cortical-basal ganglia circuits have an 
important yet unresolved function in this trial-and-error skill learn- 
ing’; influential ‘actor-critic models propose that basal ganglia 
circuits generate a variety of behaviours during training and learn 
to implement the successful behaviours in their repertoire’. Here 
we show that the anterior forebrain pathway (AFP), a cortical- 
basal ganglia circuit*, contributes to skill learning even when it does 
not contribute to such ‘exploratory variation in behavioural per- 
formance during training. Blocking the output of the AFP while 
training Bengalese finches to modify their songs prevented the 
gradual improvement that normally occurs in this complex skill 
during training. However, unblocking the output of the AFP after 
training caused an immediate transition from naive performance 
to excellent performance, indicating that the AFP covertly gained 
the ability to implement learned skill performance without con- 
tributing to skill practice. In contrast, inactivating the output 
nucleus of the AFP during training completely prevented learning, 
indicating that learning requires activity within the AFP during 
training. Our results suggest a revised model of skill learning: basal 
ganglia circuits can monitor the consequences of behavioural vari- 
ation produced by other brain regions and then direct those brain 
regions to implement more successful behaviours. The ability of the 
AFP to identify successful performances generated by other brain 
regions indicates that basal ganglia circuits receive a detailed effer- 
ence copy of premotor activity in those regions. The capacity of the 
AFP to implement successful performances that were initially pro- 
duced by other brain regions indicates precise functional connec- 
tions between basal ganglia circuits and the motor regions that 
directly control performance. 

We assessed the contributions of basal ganglia circuitry to learned 
modification of adult Bengalese finch song, a complex behaviour con- 
sisting of a sequence of 30-100-ms ‘syllables’, each with a highly stereo- 
typed acoustic structure. The song-specific motor control system 
consists of a motor pathway, which is analogous to mammalian pre- 
motor and primary motor cortex and is sufficient to produce well- 
learned elements of song, and the AFP, which is necessary for juvenile 
song learning and adult song modification*. We elicited learning by 
training birds with aversive reinforcement contingent on the fun- 
damental frequency of individually targeted syllables (Fig. 1a, b). 
Aversive reinforcement consisted of loud, 50-80-ms bursts of white 
noise**. Training with aversive reinforcement caused songbirds to modify 
fundamental frequency in a direction that adaptively reduced the like- 
lihood of white noise exposure; delivering white noise to performances 
of a syllable with fundamental frequency below a threshold caused an 
increase in mean fundamental frequency of that syllable (Fig. 1b), 
whereas delivery of white noise to performances with fundamental 
frequency above that threshold caused a decrease in mean fundamental 
frequency. These adaptive changes developed within hours and were 
specific to the fundamental frequency of the targeted syllable. 

Influential actor—critic models’, inspired by reinforcement learn- 
ing theory’ and supported by empirical evidence*”, propose that basal 
ganglia circuits such as the AFP are a crucial substrate for trial-and- 
error learning, generating a variety of behavioural performances and 


ultimately implementing only the performances that have led to suc- 
cessful outcomes. In the context of fundamental-frequency modifica- 
tion (Fig. la, b), the actor-critic model proposes that on each trial the 
AFP (the actor) generates distinct fundamental frequency values 
(exploratory behavioural variation; Fig. 1c), receives reinforcement 
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Figure 1 | Trial-and-error learning in adult birdsong. a, Spectrogram of 
song during an experiment in which white noise (WN) was delivered to 
targeted syllable (A) renditions with low fundamental frequency (FF) but not 
high FF. b, Delivering WN to syllables with low FF (shaded region) elicited 
increases in FF. Each point corresponds to one syllable rendition; the black line 
indicates the running average. c, The song circuit includes a motor pathway, 
containing HVC and RA, and the AFP, which is important for learning. The 
AFP generates variation in performance (motor exploration); red and light blue 
indicate distinct activity patterns in the AFP that lead to distinct FF values on 
different renditions of the same syllable. d, Actor—critic models propose that the 
AFP receives feedback about the behavioural variants that it generates, and this 
feedback strengthens patterns of AFP activity yielding better outcomes (light 
blue, feedback shown) and weakens patterns of AFP activity yielding worse 
outcomes (red). e, This changes the output of the AFP so that it selectively 
implements more successful behaviours. f, We tested this model by blocking the 
output of the AFP during training, thus preventing the AFP from generating 
variation in FF. g, The model predicts that this will prevent learning-related 
plasticity in the AFP, and thus there will be no change in FF, even when AFP 
output is unblocked after training. 


IW. M. Keck Center for Integrative Neuroscience, Department of Physiology, and the Neuroscience Graduate Program, University of California, San Francisco, California 94143, USA. 


14 JUNE 2012 | VOL 486 | NATURE | 251 


©2012 Macmillan Publishers Limited. All rights reserved 


LETTER 


signals about the consequences of that variation from dopaminergic 
neurons (the critic; Fig. 1d), and changes the probability of generating 
that fundamental frequency value in the future on the basis of its 
consequences*'*"!*, Over time, the AFP gradually adjusts its output 
to implement (that is, to cause the execution of) behaviours with better 
consequences, leading to adaptive changes in fundamental frequency 
and thus improved skill performance (Fig. le). Consistent with this 
model, blocking AFP output through lesions or reversible inactivations 
reduces song variation, indicating that the AFP generates variation in 
song performance that might serve as motor exploration*® (Fig. Ic, f). 
Moreover, blocking AFP output after learning reduces the expression 
of recently learned song changes, suggesting that the AFP can contribute 
to learning by biasing the motor pathway to implement more successful 
behaviours’** (as suggested in Fig. le). A critical yet untested proposition 
of this model is that learning requires the reinforcement of exploratory 
behavioural variation generated by the AFP; therefore, preventing the 
AFP from contributing to behavioural variation during training should 
prevent trial-and-error learning (Fig. 1f, g). 

We tested this prediction by pharmacologically blocking the output 
of the AFP, training birds with aversive reinforcement, and then 
unblocking the output of the AFP. To block contributions of the AFP 
to exploratory variation in song during training, while leaving intrinsic 
AFP circuitry intact, we exploited a pharmacological distinction 
between inputs that the songbird motor cortical nucleus RA (robust 
nucleus of the arcopallium) receives from premotor cortical nucleus 
HVC and from AFP output nucleus LMAN (lateral magnocellular 
nucleus of the anterior nidopallium). Inputs from LMAN are mediated 
almost exclusively by N-methyl-D-aspartate (NMDA) receptors 
whereas inputs from HVC are mediated by both NMDA receptors 
and o-amino-3-hydroxy-5-methyl-4-isoxazole propionic acid (AMPA) 
receptors* (Fig. 2a). To disrupt AFP output reversibly we therefore 
inserted microdialysis probes into RA and used retrodialysis to switch 
between a control solution (artificial cerebrospinal fluid; ACSF) and a 
solution containing the NMDA receptor antagonist 2-amino-5- 
phosphonovaleric acid (APV) at 1-5 mM (Fig. 2a). Consistent with 
previous reports’*’, this manipulation affected song in the same 
manner as pharmacological inactivations or lesions of LMAN‘*"*, 
reducing the coefficient of variation (CV) of the fundamental fre- 
quency by 31.7 + 5.6% (m= 12 syllables in 9 birds) without causing 
systematic changes in song structure (Fig. 2b, cand Supplementary Fig. 
2). The APV-dependent reduction in song variation was reversible; 
switching the infusion solution back to ACSF restored the CV of 
the fundamental frequency to 96.5 + 4.6% of baseline (Fig. 2c and 
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Supplementary Fig. 2c). These data indicate that infusing APV into 
RA effectively and reversibly prevents the AFP from contributing to 
song variation (as shown schematically in Fig. Ic, f). 

As predicted by an actor-critic model of AFP function, there was no 
expression of learning while AFP output was blocked during training. 
We compared learning in control experiments (an example is shown in 
Fig. 3a) with learning in experiments with APV in RA throughout 
training (an example is shown in Fig. 3c). Training consisted of 
administering aversive reinforcement contingent on the fundamental 
frequency of a targeted syllable (Fig. 1a, b). To ensure that a similar 
proportion of syllable renditions received aversive reinforcement 
across experiments despite the reduced range of variation after APV 
infusion, we set the threshold for avoiding white noise at roughly the 
baseline median fundamental frequency for each targeted syllable (see 
Methods). To simplify presentation, we have plotted data so that the 
direction of learning (that reduces white noise exposure) is always 
upwards. For control experiments (n = 14 experiments for 9 syllables 
in 7 birds), there was significant expression of learning during the 
training period; the mean shift of fundamental frequency in the adaptive 
direction was 33.5 Hz, corresponding to a 1.1 + 0.35% change in fun- 
damental frequency (Fig. 3b, left bar; P<0.01, signed-rank test). In 
contrast, for experiments with APV in RA (n = 21 experiments for 12 
syllables in 9 birds), there was no expression of learning during the 
training period (Fig. 3d, left bar); the mean shift in fundamental fre- 
quency was 5.3 Hz (a 0.20 + 0.15% change) which was significantly less 
than in control conditions (P = 0.02, rank-sum test) and not signifi- 
cantly different from zero (P = 0.15, signed-rank test). These results 
indicate that infusing APV into RA eliminates any expression of learning 
during training and thus provide further support that this manipulation 
blocks AFP output. 

Learned changes to song appeared immediately when AFP output 
was unblocked after training. If learning required the AFP to transmit 
song variation during training, as predicted by an actor-critic model of 
AFP function, then blocking AFP output during training should have 
prevented learning and thus unblocking AFP output after training 
should not have revealed any learned changes to fundamental fre- 
quency (Fig. 1f, g). Contrary to this prediction, we observed learned 
changes to fundamental frequency after unblocking AFP output 
(Fig. 3c, d). These learned changes could not be predicted by any subtle 
changes in fundamental frequency during training (Supplementary 
Fig. 3) and were specific to the fundamental frequency of the targeted 
syllable (Fig. 3e and Supplementary Fig. 4). The average learned 
change across experiments was 27.6Hz, corresponding to a 
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Figure 2 | Infusing APV into RA reduced song variability reversibly without 
distorting song structure. a, The AFP contains the striatopallidal nucleus 
Area X, the thalamic nucleus DLM and the cortical nucleus LMAN, which 
projects to RA. We blocked AFP output to the motor pathway by infusing the 
NMDA receptor (NMDAR) antagonist APV into RA. AMPAR, AMPA 
receptor. b, Infusion of APV into RA did not markedly change the song. 
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experiments Previously reported 


c, Infusions of APV into RA reduced the coefficient of variation (CV) of FF, 
which recovered after switching back to ACSF (n = 12 syllables in 9 birds). The 
decrease in CV with APV in RA (31.7% + 5.6%) was not significantly different 
from previously reported effects of lesions (34.1 + 4.5%) and inactivations 
(28.4 + 6.0%) of LMAN in adult Bengalese finches. Error bars indicate s.e.m. 
Previously reported values are from refs 14 and 16. 
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Figure 3 | Infusing APV into RA prevents the expression but not the 
acquisition of learning. a, b, Control experiments (ACSF in RA). a, Example 
of experiment in which white noise was delivered to targeted syllables with low 
FF. Arrowheads indicate FF at end of training (1) and after training (2). The 
dashed line indicates the delay between measurements at the end of training 
and after training. b, For control experiments (m = 14 experiments in 7 birds), 
learning was expressed at a similar magnitude at the end of training (1) and 
after training (2). Learning was normalized as a percentage of baseline FF. Error 
bars indicate s.e.m. ¢, d, Experiments with APV infused into RA. c, Example of 
experiment with AFP output blocked throughout the training period. 
Arrowheads indicate FF at end of training (1) and after training and APV 
washout (2). d, For experiments with APV in RA (n = 21 experiments in 9 
birds), learning at end of training (1) was not significantly greater than zero and 
was significantly less than in control experiments. Learning after training and 
APV washout (2) was significantly greater than zero and was the same 
magnitude as in control experiments. e, After training and APV washout, 
learning was evident in syllables targeted with reinforcement (left) but not in 
other syllables of the same songs that were not targeted with reinforcement 
(right). This analysis was performed for each experiment in which FF of a non- 
targeted syllable could be reliably quantified (mn = 17 of 21 total experiments). 
f, Mean progression of learning for control experiments (left) and after 
unblocking AFP output for experiments with APV in RA (right). Points 
correspond to syllable renditions 1-5, 1-50, 51-100, ..., 451-500. Dashed lines 
indicate s.e.m. 


0.99 + 0.17% change in fundamental frequency (m = 21 experiments 
in 9 birds; Fig. 3d, right bar; P< 0.001, signed-rank test). The mag- 
nitude of learning expressed after training was statistically indistin- 
guishable from the magnitude of learning in control experiments 
(Fig. 3b, d, right bars; P>0.9, rank-sum test). In contrast to the 
gradual progression of learning in control experiments, maximal 
learning was expressed immediately after unblocking AFP output 
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and did not require further practice with AFP output unblocked 
(Fig. 3f). Thus, during training with AFP output blocked, the AFP 
had not only encoded a ‘policy’ specifying the change in song that 
would improve outcomes (for example, the fundamental frequency 
of the targeted syllable should be increased) but had already altered 
its activity to implement that change. 

The acquisition of learning during training with APV in RA is 
consistent with three classes of mechanism. First, learning could 
require activity in the AFP during training. Second, learning could 
require plasticity upstream of the AFP, possibly in the ventral tegmental 
area, and the AFP could merely serve as a conduit between the site of 
plasticity and behavioural output. Third, learning could require plasticity 
downstream of the AFP, in RA, but the expression of that learning could 
be gated by AFP output'*. To discriminate between these possible 
mechanisms we inactivated LMAN during training, by infusing 
muscimol (n = 12 experiments in 3 birds) or lidocaine (n = 2 experi- 
ments in 1 bird) into LMAN (Fig. 4a). Whereas infusing APV into RA 
blocked AFP output while leaving activity in the AFP intact, 
inactivating LMAN not only blocked AFP output but also disrupted 
activity within the AFP. 

We found that activity in LMAN during training is crucial for learn- 
ing. Inactivating LMAN reversibly reduced variation in fundamental 
frequency by the same amount as lesions of LMAN or infusion of APV 
into RA (CV decrease of 31.2 + 6.5%, n = 14; Supplementary Fig. 2b). 
Weensured in each case that the threshold for reinforcement continued 
to provide a directed instructive signal during the training period 
despite the reduced range of fundamental frequency variation (as in 
APV experiments; see Methods)*. As with infusing APV into RA, 
inactivating LMAN prevented any expression of learning during train- 
ing; expression of learning during training with LMAN inactivated was 
—0.19 + 0.37% (n = 14, P= 0.9, signed-rank test) in comparison with 
0.90 + 0.09% (n= 14, P=1.2 X 10 4, signed-rank test) in control 
experiments (Fig. 4b-d). However, in contrast to experiments with 
APV in RA, inactivation of LMAN during training prevented any 
acquisition of learning as assessed after the washout of drug 
(—0.07 + 0.21%, n = 14, P = 0.95, signed-rank test; Fig. 4c, d). These 
results demonstrate that inactivating AFP nucleus LMAN during 
training prevents the acquisition of learning, and therefore that activity 
within the AFP during training is essential for learning. 

Taken together, our results indicate that the capacity to adaptively 
modify a complex motor skill developed within the AFP during 
training with AFP output blocked. The prevention of learning by 
inactivating LMAN during training indicates that activity in the AFP 
is required for learning (Fig. 4). The immediate transition from naive 
performance to learned performance when we unblocked AFP output 
after training (Fig. 3) demonstrates that, during training, the AFP had 
gained the ability to improve behaviour even though that improve- 
ment was not yet expressed. For simpler forms of conditioning'”"’, 
such covert learning, indicating learning-related plasticity in the brain 
that is not accompanied by behavioural improvement, would only 
require that the brain region involved in learning received coarse 
signals about actions and stimuli’’. In contrast, our results indicate 
that the brain region involved in learning, the AFP, receives detailed 
information (an efference copy”) about the precise dynamics and 
timing of behavioural performance from the other brain regions con- 
trolling that performance. 

Our results motivate a revision to models of song plasticity'” '* and 
influential actor-critic models of skill learning”*, which propose that 
essential learning-related signals develop only in brain regions that are 
‘acting’ (that is, controlling behaviour). In contrast, our results indicate 
that the essential learning-related signals necessary to adaptively bias 
behaviour can develop in a basal ganglia circuit, the AFP, while it is 
prevented from contributing to behavioural performance and motor 
exploration. This indicates that motor exploration (that is, variation) 
generated by the AFP is not necessary for learning, and therefore 
a source of variation independent of the AFP can be exploited for 
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Figure 4 | Inactivating LMAN during training prevents both the expression 
and the acquisition of learning. a, We inactivated LMAN by infusing the 
GABA, antagonist muscimol (m = 12 experiments in 3 birds) or the sodium 
channel blocker lidocaine (n = 2 experiments in 1 bird) into LMAN (red 
arrow). b, c, Example experiments. b, Control experiment in which white noise 
was delivered to renditions ofa targeted syllable with low FF. c, As in b, but with 


reinforcement learning. Presumably, this variation arises in the motor 
pathway, possibly in RA’, and is transmitted to the AFP. In normal 
circumstances with AFP output intact, variation contributed by the 
AFP itself may also be used for reinforcement learning. The AFP may 
therefore be a specialized hub where information about behavioural 
variation from multiple sources converges and is associated with 
reinforcement signals to guide learning. 

The specificity of learning with AFP output blocked (Fig. 3e and 
Supplementary Fig. 4) implies that the AFP associates reinforcement 
signals with detailed information about ongoing song performance, 
including both the identity of the syllable being produced and the 
rendition-by-rendition variation in the fundamental frequency of that 
syllable. Reinforcement signals, indicating the presence or absence of 
white noise, could be conveyed to the AFP by means of known pro- 
jections from neuromodulatory nuclei such as the ventral tegmental 
area*'°. Signals encoding syllable identity are conveyed to the AFP by 
means of projections from nucleus HVC in the motor pathway to the 
striatopallidal nucleus Area X (ref. 4). In principle, auditory feedback 
could provide information about variation in fundamental frequency, 
but such auditory signals seem to be absent from the AFP during 
singing”. We therefore favour the alternative possibility, that informa- 
tion about fundamental frequency variation is transmitted to the AFP 
through an efference copy of activity in premotor regions, by way of 
projections from HVC to Area X and/or projections from RA to the 
basal ganglia-recipient thalamic nucleus DLM (dorsolateral division of 
the medial thalamus)**”* (Supplementary Fig. 1). This is consistent 
with a recent proposal that the transmission of efference copy signals 
from motor cortex (HVC and/or RA) to basal ganglia circuitry (AFP) 
has a fundamental function in mammalian skill learning”. 

Our results also indicate precise functional coordination between 
the AFP and the motor pathway. Immediately after unblocking AFP 
output, we observed learning that was specific to the reinforced 
features of song, indicating that the AFP had modified its output to 
direct the production of those specific features by the motor pathway. 
This implies not only that the AFP receives detailed information about 
the song performances produced by the motor pathway during train- 
ing, but also that it changes its output to specifically implement the 
features of those performances that were reinforced. Such a capacity of 
the AFP to precisely monitor and modify the activity of the motor 
pathway indicates fine-scale functional coordination both in the pro- 
jections from the motor pathway to the AFP and in the projections 
from the AFP back to the motor pathway. Such bi-directional coordi- 
nation might be mediated by segregated functional loops between the 
AFP and the motor pathway, each encoding a particular feature of 
song, such as high fundamental frequency in a particular syllable 
(Supplementary Fig. 1). Under normal conditions, with AFP output 
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LMAN inactivated during training. Arrowheads indicate FF at the end of 
training with LMAN inactivated (1) and after training and muscimol washout 
(2). d, Summary: for experiments with LMAN inactivated (1 and 2; n = 14), 
there was neither evidence for learning at the end of training (red) nor after 
training and drug washout (light blue). Error bars indicate s.e.m. 


intact, such functional loops could enable the AFP to amplify and bias 
specific behavioural features, functions that have been attributed to 
mammalian basal ganglia circuits*’’*. More generally, our results sug- 
gest that precise functional coordination between motor cortex and 
basal ganglia circuitry is important for enabling motor skill learning. 


METHODS SUMMARY 


All experiments were performed on adult (more than 120 days old) male Bengalese 
finches (Lonchura striata domestica) singing undirected song. Song recording and 
feedback delivery were performed with software’ that recognized a targeted 
syllable and delivered a 50-80-ms burst of white noise unless the fundamental 
frequency (FF) met an escape criterion. For experiments with APV in RA and 
associated controls, the threshold for escaping white noise was set near the median 
FF of the targeted syllable; thus, about 50% of syllable performances initially 
avoided white noise. We used reverse microdialysis“ to deliver the NMDA-receptor 
antagonist DL-APV (1-5 mM in ACSF) to RA, and the GABA, agonist muscimol 
(100-500 LLM) or the sodium channel blocker lidocaine (2%) to LMAN. To ensure 
the complete wash-in of drug, we delayed 1-2h between drug infusion and the 
beginning of the training period. Immediately after training, the solution was 
switched back to ACSF. To ensure complete the wash-out of drug, we delayed at 
least 1 h between switching the solution to ACSF and measuring FF performance 
after training. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Animal care. All experiments were performed on adult (more than 120 days old) 
male Bengalese finches (Lonchura striata domestica) that had been bred in our 
colony and housed with their parents until at least 60 days of age. During experi- 
ments, birds were housed individually in sound-attenuating chambers (Acoustic 
Systems) with food and water provided ad libitum. All song recordings were from 
undirected song (that is, no female was present). All procedures were performed in 
accordance with established protocols approved by the University of California, 
San Francisco Institutional Animal Care and Use Committee. 

Training. The same training parameters were used for control experiments and 
experiments with pharmacological manipulations. Song acquisition and feedback 
delivery were accomplished using previously described LabView software 
(EvTaf?), which recognized a specific time (contingency time) in a targeted syllable 
of song based on its spectral profile. On recognition, EvTaf recorded the time and 
calculated the fundamental frequency (FF) during the previous 8 ms of song. If the 
FF met the escape criterion (that is, above or below a threshold), no disruptive 
feedback was delivered. Otherwise, a 50-80-ms burst of white noise was delivered 
starting less than 1 ms after the contingency time. The duration of white noise was 
constant for a given experiment. To allow quantification of FF during training, a 
randomly interleaved 10% of songs were allocated as catch trials and did not 
receive white noise. 

Experiments with reversible disruption of LMAN transmission to RA by 
reverse microdialysis. We interfered with LMAN transmission to RA by using 
a previously described reverse microdialysis technique", in which solution diffuses 
into targeted brain areas across the dialysis membranes of implanted probes. RA 
was mapped electrophysiologically during cannula implantation so as to direct 
probes to the centre of RA. Between probe insertion and white noise training, there 
was amore than 48h period in which control solution (ACSF) was dialysed at a 
flow rate of 1plmin”'. The dialysis solution was switched from ACSF to the 
NMDA-receptor antagonist DL-APV (2-5mM in ACSF; Ascent) at least 1.5h 
before the onset of white noise training so that the threshold for escaping white 
noise could be determined on the basis of song performance with APV in RA. 
During this period we evaluated the efficacy of APV by assessing the rendition-to- 
rendition variability of FF for individual syllables. FF variability reduced and 
stabilized at an asymptotic level within the first 30 min of APV dialysis, indicating 
rapid onset and equilibrium of drug effect. We observed a reduction in variability 
similar to that reported after lesions or inactivations of LMAN"*"*. For clarity of 
presentation in Fig. 3, running averages of FF performance for experiments with 
APV in RA omit the period during APV wash-in before the onset of white noise. 
For experiments with APV in RA and the accompanying control experiments, 
white noise was delivered for 4-14 h while birds were awake. Blocking AFP output 
reduced variation in FF by an average of 31.7%, meaning that setting the threshold 
for avoiding white noise at a certain level above mean FF (for example +30 Hz) in 
control experiments and experiments with AFP output blocked would result in a 
greater proportion of syllable performances escaping aversive reinforcement in 
control experiments. To avoid this confound and ensure that a similar proportion 
of syllable renditions received aversive reinforcement in control experiments and 
experiments with AFP output blocked, we set the threshold for avoiding white 
noise at approximately the baseline median FF performance (between the 40th and 
60th centiles in all experiments). To ensure that our assessment of learning during 
the training period evaluated the effects of white noise training as opposed to the 
acute effects of APV, FF change at the end of the training period was quantified by 
subtracting FF immediately before training (during the period with APV in RA 
before the onset of white noise) from FF at the end of the training period. 
Immediately after the conclusion of white noise training, the dialysis solution 
was switched back to ACSF. Learning after the training period was quantified 
by measuring the difference between FF performance after white noise training 
(with ACSF in RA) and FF performance before white noise training and before 
infusing APV into RA (that is, with ACSF in RA). Although the latency between 
switching the solution remotely at the pumping apparatus and changing the 
solution at the probe tips was only 6 min in our experimental setup'’, the APV- 
dependent decrease in FF variability typically remained for hours after switching 
back to ACSF, presumably reflecting the combined kinetics of passive diffusion, 
active clearance and degradation mechanisms. In all experiments, birds were 
prevented from singing for at least 1.5h after being switched from APV to 
ACSF to provide time for APV washout. For quantification of learning expressed 
immediately after training (Fig. 3f), we analysed the first songs performed after this 


period. To further ensure that persisting effects of APV would not cause an 
underestimation of learning in our primary representations of the data (Fig. 3d, e), 
expression of learning was assessed the morning after the training period. This 
allowed sufficient time for the APV-dependent block of AFP output to subside 
while providing limited opportunity for the birds to sing in the absence of white 
noise; this was important because, in control conditions, singing in the absence of 
white noise results in a gradual loss of learned changes to fundamental frequency” 
(that is, extinction). In a subset of experiments (8 of 24), white noise training was 
terminated (and APV was switched to ACSF) at least 3h before sleep. In these 
experiments we found that the expression of learning before sleep was significantly 
greater than zero (0.95 + 0.25% change in FF, P < 0.02, signed-rank test) and only 
slightly less than learning the next morning (1.3% + 0.18% change in FF). This 
indicates that washout of APV, independently of a period of sleep, was sufficient to 
enable the expression of learning. Probe position in RA was established by using 
electrophysiological mapping of RA during implantation and confirmed post 
mortem by identifying cannula tracts in brain sections stained for Nissl bodies. 
Additionally, in three birds, biotinylated muscimol (diluted to 500 1M; EZ-link 
biotin kit; Pierce) was dialysed across the diffusion membrane to estimate the path 
of diffusion from the membrane". In these birds, probe position was determined 
post mortem by histological staining for biotin and by comparing interleaved 
sections stained for Nissl bodies. Spread of drug outside RA tended to be in regions 
dorsal to RA, along the cannula, but not into the lateral areas where nucleus Ad is 
located. 

Experiments with reversible inactivation of LMAN by reverse microdialysis. 
We examined the progression of learning for data from experiments in which we 
transiently inactivated LMAN by using the same reverse dialysis technique that we 
used for infusing APV into RA™. To inactivate LMAN, we switched the dialysis 
solution from ACSF to the GABA, agonist muscimol (100-500 uM; Sigma; 3 
birds, 12 experiments) or the Na* channel blocker lidocaine (2%; Hospira; 1 bird, 
2 experiments) at a flow rate of 1 pl min’ '. Inactivations lasted for 3-4h, during 
whicha 1 pl min~ ! flow rate was maintained. At the conclusion of inactivation, the 
dialysing solution was switched back to ACSF. We applied white noise contingent 
on FF over a total period of 2 days or more, during both control and LMAN 
inactivation periods. The threshold for escaping white noise was raised incrementally 
to drive progressive changes in FF. In each experiment, FF eventually reached a 
stable value because we stopped raising the threshold. We only considered LMAN 
inactivations on days before FF reached this stable value, to ensure that the bird 
retained the capacity for further learning. For each LMAN inactivation, learning 
after training was quantified as the difference in FF between the last 50 renditions 
of the syllable before infusion of drug and the first 50 renditions of the syllable after 
drug washout, normalized as for experiments with APV in RA. We excluded the 
first 1h after switching the infusion solution to ACSF to permit washout. During 
the period with LMAN inactivated, which lasted a minimum of 3 h, the threshold 
for escaping white noise was set so that more than 10% but less than 50% of 
syllables escaped and thus a learning signal of differential reinforcement was 
present in each experiment. This is crucial for interpretation of the lack of learning 
in these experiments, because learning in this model does not proceed without such 
differential reinforcement’. Learning during training with LMAN inactivated was 
quantified with a linear regression of FF on the renditions of the targeted syllable 
during training with LMAN inactivated. For each inactivation, matched learning 
in control conditions was quantified by calculating the average hourly rate of 
change in FF during ACSF infusion on the day of that inactivation and multiplying 
that rate by the number of hours for which LMAN was inactivated. Probe posi- 
tioning and the path of drug diffusion were evaluated post mortem by histological 
staining of sectioned tissue as described previously’. Tissue damage caused by 
cannulae enabled confirmation that probes were accurately targeted to LMAN. In 
addition, biotinylated muscimol or ibotenic acid was used to estimate the spread of 
diffusion, as described previously". 

Analysis. All analyses were performed with custom software written in MATLAB 
(Mathworks). Fora given syllable, FF was measured over a consistent time window 
aligned to syllable onset; for syllables targeted with white noise feedback, the 
measurement time window was centred on the median time at which feedback 
was delivered. FF was calculated as described previously’ for both targeted syllables 
and non-targeted syllables of the same song. Spectral entropy, volume and dura- 
tion were calculated as described previously’. Statistical significance was tested 
with non-parametric statistical tests; Wilcoxon signed-rank tests and Wilcoxon 
rank-sum tests were used where appropriate. 
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Autism spectrum disorders comprise a range of neurodevelopmental 
disorders characterized by deficits in social interaction and commun- 
ication, and by repetitive behaviour’. Mutations in synaptic proteins 
such as neuroligins”’, neurexins*, GKAPs/SAPAPs’ and ProSAPs/ 
Shanks*”° were identified in patients with autism spectrum disorder, 
but the causative mechanisms remain largely unknown. ProSAPs/ 
Shanks build large homo- and heteromeric protein complexes at 
excitatory synapses and organize the complex protein machinery of 
the postsynaptic density in a laminar fashion’’’*. Here we dem- 
onstrate that genetic deletion of ProSAP1/Shank2 results in an early, 
brain-region-specific upregulation of ionotropic glutamate receptors 
at the synapse and increased levels of ProSAP2/Shank3. Moreover, 
ProSAP1/Shank2~‘~ mutants exhibit fewer dendritic spines and 
show reduced basal synaptic transmission, a reduced frequency of 
miniature excitatory postsynaptic currents and enhanced N-methyl- 
D-aspartate receptor-mediated excitatory currents at the physio- 
logical level. Mutants are extremely hyperactive and display profound 
autistic-like behavioural alterations including repetitive grooming 
as well as abnormalities in vocal and social behaviours. By com- 
paring the data on ProSAP1/Shank2-‘~ mutants with ProSAP2/ 
Shank3af '~ mice, we show that different abnormalities in synaptic 
glutamate receptor expression can cause alterations in social inter- 
actions and communication. Accordingly, we propose that appro- 
priate therapies for autism spectrum disorders are to be carefully 
matched to the underlying synaptopathic phenotype. 

Many of the recently identified autism spectrum disorders (ASD) 
candidate genes code for proteins of excitatory synapses'*"*, suggesting 
that these disorders may arise from molecular imbalances of synaptic 
connections. In this context, targeted disruption of the ProSAP2/Shank3 
gene in mice resulted in molecular perturbations of glutamatergic 
synapses and profound autistic-like behaviour'®”’. Here we generated 
mice lacking all isoforms of ProSAP1/Shank2 (Fig. 1A and Supplemen- 
tary Fig. la-g) to decipher the interrelation between ProSAP1/Shank2 
protein levels, synaptic architecture, neurophysiology and behaviour in 
mice. 

Heterozygous ProSAP1/Shank2*’~ (expressing approx. 50% of 
ProSAP1/Shank2 protein, Fig. 1A) and homozygous ProSAP1/ 
Shank2~'~ mutants were viable, but their survival rate was lower com- 
pared with wild-type littermates (Supplementary Fig. 2a). Although 
body weight was reduced (Supplementary Fig. 2b-e), adult mutants 
displayed normal appearance and overall brain morphology (data not 


shown). However, hindlimb clasping was observed (Supplementary 
Fig. 2f), similarly to some other mouse models of ASD”°”’. 

Owing to high expression of ProSAP1/Shank2 in the hippocampus 
during spinogenesis” and as patient-based mutations in ProSAP1/ 
Shank2 were recently shown to alter dendritic spines in the hippocam- 
pus”, we assessed spine density and synaptic ultrastructure in the CA1 
region. We found a small reduction of spine numbers in ProSAP1/ 
Shank2~'~ mutants (Fig. 1B, a) whereas postsynaptic density (PSD) 
length or thickness was not significantly altered (Fig. 1B, b). 
Biochemical analysis revealed higher levels of the N-methyl-p- 
aspartate receptor (NMDAR) subunit GluN1 and ProSAP2/Shank3 
in whole brain PSDs of ProSAP1/Shank2-‘~ mice (Supplementary 
Fig. 3a). Interestingly, ProSAP2/Shank3 upregulation specifically 
occurred at synapses, as protein and messenger RNA (mRNA) levels 
were not changed significantly in whole brain of mutant versus wild- 
type animals (Supplementary Fig. 3b, c). Further evidence for local 
compensation was apparent by subfractionation experiments and 
transient knockdown of ProSAP1/Shank2 in primary hippocampal 
cultures, resulting in a rapid increase of GluN1 and ProSAP2/ 
Shank3 at synaptic sites (Supplementary Fig. 3b, d). 

Based on the ProSAP1/Shank2 expression profile in wild-type 
mouse brain (Supplementary Fig. 4), we biochemically isolated crude 
synaptosomal fractions from cortex, hippocampus and striatum of 
wild-type, ProSAP1/Shank2*’~ and ProSAP1/Shank2~'~ mice at 
postnatal day (P)25 and P70 to examine molecular alterations with 
respect to brain regions and development. The major change com- 
pared with wild types was an early increase of NMDAR subunits in the 
hippocampus and striatum of ProSAP1/Shank2 ‘~ mutants. Notably, 
this increase was sensitive to ProSAP1/Shank2 gene dosage as it was 
also observed in ProSAP1/Shank2*’~ mice, but to a lesser extent. At 
P70 the upregulation of ProSAP2/Shank3 was observed in all brain 
regions investigated (Fig. 1C, D and Supplementary Figs 5 and 7; cortex 
data not shown). We compared these observations with molecular 
synaptic changes when major isoforms of ProSAP2/Shank3 are not 
present and analysed ProSAP2/Shank3xp'~ mice (similar to recently 
published Shank3 mutants’’, Supplementary Fig. 6). Especially in the 
striatum, we observed a clear difference between ProSAP1/Shank2 /— 
and ProSAP2/Shank3«f ‘~ mice with respect to their synaptic content 
of ionotropic glutamate receptors. The levels of most subunits were 
higher in ProSAP1/Shank2'~ and lower in ProSAP2/Shank3ap /~ 
mice. Interestingly, apart from the increase of ProSAP2/Shank3 in 
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Figure 1 Cyto-architechtural and molecular changes in ProSAP1/ type and ProSAP1/Shank2~‘~ animals (right panel). Data are presented as 
Shank2™'~ mouse brain. A, Western blot of pooled (nm = 10) whole brains (left | cumulative frequency plots, small insets depict median values compared 


panel) and cerebella (upper right panel) from wild-type (+/+) and ProSAP1/ 
Shank2-'~ (—/—) mice as indicated. ProSAP1/Shank2 isoforms are marked by 
arrowheads: ProSAP1/Shank2 (black), ProSAP1A/Shank2A (white), 
ProSAP1E/Shank2E (grey). Forebrain wild-type homogenate was used as 
control to differentiate cerebellar isoforms. Total ProSAP1/Shank2 mRNA 
(middle panel) and protein levels (right lower panel) from wild-type, ProSAP1/ 
Shank*’~ (+/—) and ProSAP1/Shank2 ‘~ hippocampi. B, a, Representative 
images of secondary dendrites from CA1 hippocampal neurons of adult wild- 
type and ProSAP1/Shank2"'~ mice (Golgi-Cox staining, scale bar: 1 4M) and 
quantification of spine density from n = 6 wild-type (white bar) and ProSAP1/ 
Shank2~'— (black bar) littermate pairs. B, b, Representative electron 
microscopy images of CA1 synapses from wild-type and ProSAP1/Shank2 '~ 
animals. Synaptic vesicles (arrowheads), PSDs (arrows) and dendritic spines 
(asterisks). Scale bar: 100 nm. Analysis of PSD length and thickness from wild- 


ProSAP1/Shank2~'~ mice, we detected a vice versa upregulation of 
ProSAP1/Shank2 in ProSAP2/Shank3uf~'~ mice (Fig. 1C, D and 
Supplementary Figs 5-7). This phenomenon was not due to an 
increase of transcript levels and was observed in whole brain PSDs 
from both animal models (Supplementary Fig. 8a-c). 

To analyse how the altered molecular composition of ProSAP1/ 
Shank2~'~ synapses influences synaptic transmission, we performed 
extracellular field and whole-cell patch clamp recordings from CA1 
pyramidal cells in acute hippocampal slices. Field excitatory 
postsynaptic potentials (fEPSPs) were decreased by approximately 

40% in ProSAP1/Shank2 ' (Fig. 2A) as well as ProSAP1/Shank2‘/~ 
animals (Supplementary Fig. 9a). The reduced synaptic transmission 
was not only found in young mice (P21-P28), but also in older animals 
(3 months of age, Fig. 2A, c, investigated in ProSAP1/Shank2 ‘~ only). 
There was no evidence for genotypic differences in the excitability of 
presynaptic fibres, the intrinsic firing threshold and the whole-cell input 
resistance of CAl pyramidal cells (Supplementary Fig. 9b-d). When 


between wild-type (white bars) and ProSAP1/Shank2 ‘~ (black bars) animals. 
n = 220 PSDs for six wild types and n = 215 PSDs for six ProSAP1/Shank2 ‘~ 
mice. C, Semi-quantitative analysis of proteins in crude synaptosomal fractions 
from different brain regions of wild-type, ProSAP1/Shank2 ‘~ and ProSAP2/ 
Shank3aB'~ mice as indicated. Mutant protein was normalized to wild-type 
levels and is plotted as relative change of expression levels. D, Colour-coded 
visualization of protein levels (ProSAP1/Shank2, ProSAP2/Shank3, NMDAR, 
AMPAR) in ProSAP1/Shank2 '~ or ProSAP2/Shank3ap '~ brains (cortex, 
hippocampus, striatum) at the indicated time points (P25, P70). C, D, Red bars/ 
colour indicate elevated, blue bars/colour decreased, protein levels. A-D, +/+, 
wild types; +/—, ProSAP1/Shank2*’~; —/—, ProSAP1/Shank2~'~. All data are 
presented as mean = s.e.m.; all P values are derived from unpaired, two-tailed 
Student’s t-tests (*P < 0.05, **P< 0.01, ***P< 0.001). 


analysing miniature excitatory postsynaptic currents (mEPSCs) from 
CAI pyramidal cells (Fig. 2B), ProSAP1/Shank2~‘~ mice showed a 
significant reduction in mEPSC frequency. There was no evidence for 
differences in mEPSC amplitudes and «-amino-3-hydroxy-5-methyl- 
4-isoxazole propionic acid (AMPA)-mediated whole-cell currents 
(reflecting the total number of synaptic plus extrasynaptic AMPA 
receptors) (Fig. 2C). To probe for possible changes in NMDAR- 
mediated excitatory synaptic transmission, we compared the relative 
contribution of NMDA versus AMPA receptors to evoked EPSCs. In 
agreement with the upregulation of NMDAR subunits in hippocampi 
of ProSAP1/Shank2 "~ mice (see Fig. 1C, D), we found an approxi- 
mately 30% increased NMDA/AMPA ratio in mutants versus wild types 
(Fig. 2D). We further analysed synaptic plasticity. NMDAR-dependent 
long-term potentiation induced by high-frequency stimulation of the 
Schaffer collaterals was slightly enhanced in ProSAP1/Shank2~‘~ mice 
(Fig. 2E). We found no evidence for alterations in long-term depression 
between genotypes (Supplementary Fig. 9e). As imbalanced excitation/ 
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Figure 2 | Imbalanced hippocampal glutamatergic synaptic transmission in 
ProSAP1/Shank2 ‘mice. A, Input-output curves for basal synaptic 
transmission. As illustrated in the sample traces (A, a, averages of six fEPSPs) 
and in the quantification (A, b), ProSAP1/Shank2-‘~ (—/ —) mice suffer from 
reduced synaptic transmission compared with wild-type controls (+/+) at the 
age of P21—P28 (two-way analysis of variance (ANOVA): P< 0.05; +/+: 
number of experiments (m) and number of animals (N) = 8 (3); —/—: n= 11 
(4)). This defect is also found in mice that are 3 months of age (A, c) (two-way 
ANOVA: P< 0.05; +/+: 1 (N) =7 (3); —/—: n (N) =7 (3)). B, MEPSCs in 
CA1 pyramidal cells. B, a, Sample traces of individual recordings (left) and an 
average of all mEPSC events (right). B, b, The frequency of mEPSCs is reduced 
in ProSAP1/Shank2‘~ (Student’s t-test: P< 0.05; +/+:n(N) = 12(4);—/-:n 
(N) = 16 (5)). B, c, Cumulative fraction distribution of mEPSC amplitudes 
(two-sample Kolmogorov-Smirnoyv test: P = 0.96; +/+: 504 events, n (N) = 12 


inhibition ratios have been repeatedly implicated in models of autism’, 
we also analysed GABAergic (y-aminobutyric acid-mediated) synaptic 
transmission. Frequency and amplitude of inhibitory postsynaptic 
currents (both miniature and spontaneous) were largely unchanged 
in ProSAP1/Shank2'‘~ mice (Supplementary Fig. 10a, b). Based on the 
electrophysiological analyses, we conclude that merely glutamatergic 
transmission is impaired in ProSAP1/Shank2 mutants. 
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Figure 3 | Increased locomotor activity and stereotypical behaviours in 
ProSAP1/Shank2~'~ mice. a, Examples of trajectories of a wild-type mouse 
and a ProSAP1/Shank2 ‘~ mouse in 30 min exploration of the open field. 

b, Distance travelled by male and female mice during 30 min free exploration of 
a circular maze. c, Mean digging bout duration in male and female mice. 

d, Mean self-grooming bout duration in male and female mice. Data are 
presented as mean + s.e.m. (Mann-Whitney U-tests: *P < 0.05; **P < 0.01; 
***P < 0,001). Unless otherwise specified, (n+), = 16,4); = 16,n_)_ = 16) 
males and (n+/4 = 16, n4;_ = 16, n_;_ = 13) females were tested. 
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(4); —/—: 630 events, n = 15 (5)). Inset: mean mEPSC amplitudes (Student’s 
t-test: P = 0.37; sample sizes as above). C, Whole-cell currents evoked by bath 
application of 20nM AMPA (Student’s f-test: P = 0.9; +/+: n (N) = 8 (3); 
—/—:n (N) = 9 (3)). D, NMDA/AMPA ratios estimated from compound 
EPSCs evoked at +40 and —60 mV, respectively. As illustrated in the sample 
traces (D, a) and the quantification (D, b), the ratio of synaptic NMDA versus 
AMPA receptors is significantly increased in ProSAP1/Shank2~‘~ animals 
(Student’s t-test: P< 0.05; +/+: n (N) = 18 (8); —/—:n (N) = 19 (6)). E, Long- 
term potentiation is increased in ProSAP1/Shank2 ‘~ mice, as evident from the 
sample traces (E, a), the average time plot (E, b) (two-way ANOVA: P< 0.01; 

+/+: n (N) = 30(5); —/—: n = 34 (6)) and the ratio of fEPSP slopes 30 min 
after versus before induction of long-term potentiation (E, c) (Student’s t-test: 
P<0.001; sample sizes as above). All data are presented as mean = s.e.m. 
*P< 0.05, **P< 0.01, ***P< 0.001. 


Despite these synaptic abnormalities, ProSAP1/Shank2 ‘~ mice 
displayed functional working memory, motor coordination, olfaction 
and object recognition (Supplementary Fig. lla-g). The most 
remarkable behavioural phenotype was hyperactivity. When compared 
with wild-type littermates, male and female ProSAP1/Shank2-'~ mice 
displayed twice the level of locomotor activity in the open field (Fig. 3a, 
b) and in other tests (Supplementary Fig. 12a-e). ProSAP1/Shank2 ‘~ 
males displayed an increased level of anxiety during the light-dark box 
test (Supplementary Fig. 12f, g). Compared with wild types, digging 
bouts of ProSAP1/Shank2'~ mice were significantly shorter (Fig. 3c), 
and self-grooming in ProSAP1/Shank2-‘~ females was significantly 
extended (Fig. 3d). These stereotyped behaviours, however, were less 
severe compared with other mouse models of ASD such as ProSAP2/ 
Shank3ap '~ mutants!””° or BTBR T+tf/J mice”. 

We next examined social behaviour. During free same-sex social 
interactions (resident-intruder), the latency for the first contact did 
not differ significantly (Fig. 4a), but both male and female ProSAP1/ 
Shank2~'~ mice had difficulties maintaining social contacts or were 
less interested in them (Fig. 4b). During free interactions of a tested 
male mouse with an oestrus C57BL/6 female mouse, the latency for the 
first contact was significantly longer for ProSAP1/Shank2~'~ males 
than wild-type males (Fig. 4a), but no impairment in contact main- 
tenance was detected (Fig. 4b). During the three-chamber test, both 
male and female mutants displayed a reduction in conspecific recog- 
nition or in their interest for social novelty compared with wild types 
(Supplementary Fig. 13a-d). 
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Figure 4] Abnormalities in social and vocal behaviour of ProSAP1/ 
Shank2 " mice in the resident-intruder test and during the interaction ofa 
male with an oestrus female. a, Latency for the first contact in same-sex free 
interactions (C57BL/6 resident-intruder) when the resident mouse was 
isolated from weaning on (males) or for 3 days before the experiment (females), 
and in the interaction of a male with a C57BL/6 oestrus female. b, Time spent in 
contact during same-sex free interactions when the resident mouse was isolated 
from weaning on (males) or for 3 days before the experiment (females), and in 
the interaction of a male with an oestrus female (n). = 15, n+);_ = 16, 

n_)— = 16). ¢, Latency for the first ultrasonic vocalization in same-sex free 
interactions when the resident mouse was isolated from weaning on (males) or 
for 3 days before the experiment (females), and in the interaction of a male with 
an oestrus female. d, Rate of calling during same-sex free interactions when the 
resident mouse was isolated from weaning on (males) or for 3 days before the 
experiment (females), and in the interaction of a male with an oestrus female. 
Data are presented as mean + s.e.m. (Mann-Whitney U-tests: *P < 0.05; 
**P < 0.01; ***P < 0.001). Unless otherwise specified, (n/4 = 16,+,_ = 16, 
n_);— = 16) males and (n/4 = 16, n4;— = 16, n_;— = 13) females were tested. 


During free social interactions or when isolated as pups, mice emit 
ultrasonic vocalizations. In pups, ProSAP1/Shank2 '~ females, but 
not males, called at a significantly higher rate than wild-type females 
at P4 and P10 (Supplementary Fig. 14a, b). In adults, during male- 
male social interactions, few calls were recorded and there was no 
significant difference between genotypes (Fig. 4c, d). In contrast, during 
female-female interactions, we observed a significantly longer latency 
to emit the first vocalization (Fig. 4c) and significantly fewer vocaliza- 
tions (Fig. 4d) for pairs involving a ProSAP1/Shank2 ‘~ mouse com- 
pared with pairs involving wild-type females. Notably, in pairs 
involving ProSAP1/Shank2-‘~ mutants, mice uttered more short 
and unstructured calls and fewer mixed calls than pairs involving wild 
types (Supplementary Fig. 14c, e). In the socio-sexual context of a male 
in the presence of an oestrus female, the latency for the first ultrasonic 
vocalization was significantly longer in pairs involving ProSAP1/ 
Shank2~‘~ males than pairs involving wild types (Fig. 4c). Similarly 
to females, more short and unstructured calls were emitted (Sup- 
plementary Fig. 14d, e). 

In conclusion, based on this study and on previous reports, mice 
lacking any member of the ProSAP/Shank family display recurrent 
features observed in animal models for ASD: that is, preserved working 
memory, but increased anxiety and abnormalities in both social inter- 
actions and vocalizations (Supplementary Fig. 15)'°1°?°?. 

In summary, here we demonstrate that altered glutamatergic 
neurotransmission can lead to the core symptoms of ASD. In addition, 
this study shows that ProSAP1/Shank2 and ProSAP2/Shank3 seem to 
serve different interrelated functions at excitatory synapses, especially 
in glutamate receptor targeting/assembly. However, the exact molecular 
mechanisms are still to be deciphered. In any case, our comparative 
analysis of mice lacking either ProSAP1/Shank2 or major isoforms of 
ProSAP2/Shank3 reveals that mutations of very similar proteins within 
the same synaptic pathway can have different molecular consequences 
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(for example, excess or deficit of glutamate receptors), but both lead to 
abnormal social and vocal behaviours. Future studies should tell 
whether gene- or pathway-specific therapies are necessary to modulate 
or even reverse the pathophysiology of ASD. 


METHODS SUMMARY 


Biochemistry, Golgi staining and electron microscopy. A subfractionation pro- 
tocol” was performed to obtain subcellular fractions from brain tissue of juvenile 
and/or adult wild-type, ProSAP1/Shank2 and ProSAP2/Shank3 mutant mice from 
both sexes. After immersion in Golgi-Cox solution for 21 days, adult brains were 
cut in 200 jum sagittal sections to develop Golgi—Cox staining for analysis of spine 
density. Further, adult mice were perfused, brains were dissected out, stained and 
cut in ultrathin sections to be examined by electron microscopy. 
Electrophysiology. Extracellular field and whole-cell patch-clamp recordings 
were performed in horizontal hippocampal slices from mice of both sexes. 
Evoked postsynaptic responses were induced by electrical stimulation of 
Schaffer collaterals in CA1 stratum radiatum. fEPSPs were recorded in stratum 
radiatum. Long-term potentiation was induced by a single tetanus of 100 pulses at 
100 Hz. Long-term depression was induced by 15 min paired pulse stimulation at 
1 Hz with 50 ms between single pulses. mEPSCs, whole-cell AMPA currents and 
inhibitory postsynaptic currents (IPSCs) were recorded in whole-cell patch-clamp 
configuration from CA1 pyramidal cells voltage-clamped at —60 mV. For estima- 
tion of NMDA/AMPA ratios, compound EPSCs were evoked at —60 and 
+40 mV. 
Behavioural analysis. Three cohorts of mice (C57BL/6 background) were tested. 
Cohort 1 included pups for the developmental study to examine pup vocal beha- 
viour, motor coordination, olfaction and developmental milestones. Adult beha- 
viour was tested on cohort 2 in the following order: light-dark anxiety test, open 
field, Y-maze, three-chamber test, self-directed and digging behaviours, resident- 
intruder test, male behaviour in presence of an oestrus female, buried-food finding 
test and object recognition. Cohort 3 was used for the general neurological exam- 
ination, juvenile body weight and to analyse motor coordination. 

All animal procedures were in accordance with institutional, state and govern- 
ment regulations (Tubingen: 0.103; Berlin: LAGeSo, T0100/03; Paris: CEEA Ile- 
de-France Comité 1). 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Animals. Two mouse C57BL/6 genomic Xbal DNA fragments coding for exon VI 
and VII of the ProSAP1/Shank2 PDZ domain were subcloned from a bacterial 
artificial chromosome (pBelo-BAC II 21259) yielding a 16.3-kilobase (kb) 
chromosomal DNA fragment. For target vector construction, the long arm, coding 
for exon VI, was cloned as an EcoRI-Xbal fragment (7.7 kb) into pBluescript II SK 
vector (Thermo Scientific) followed by an XbaI-Xhol fragment (1.1 kb) coding for 
exon VII in which a loxP sequence was integrated into the unique SphI site. The frt 
PGK neo frt loxP cassette was cloned into the Xhol site followed by the short arm 
(2.8 kb) as an XhoI-Sphl fragment. The linearized targeting construct was elec- 
troporated into recombinant inbred embryonic stem cells (RI-ES cells) (passage 15 
(129/SV X 129/SV-CP)) at 25 uF and 400 V (Gene Pulser; Bio-Rad). After elec- 
troporation, cells were plated onto culture dishes (100 mm diameter) containing a 
gamma-irradiated monolayer of G418-resistant PMEF cells. Thirty-two hours 
later, 350 ug of G418 (Invitrogen) per millilitre and 0.2 1M 2’-deoxy-2'-fluoro- 
B-p-arabinofuranosyl-5-iodouracil (Moravek Biochemicals and Radiochemicals) 
were added to the culture medium. The medium was replaced every day, and 
colonies were picked and analysed 8 days after plating. Cells were expanded in 
HEPES-buffered Dulbecco’s modified Eagle’s medium supplemented with 15% 
fetal bovine serum (Thermo Scientific), 1,000 U of recombinant leukaemia inhib- 
itory factor (Millipore) per millilitre, non-essential amino acids, L-glutamine, 
B-mercaptoethanol and antibiotics (penicillin 100U ml ', and streptomycin 
100 1g ml~'). For electroporation, 2 X 10’ cells were re-suspended in 20 mM 
HEPES (pH 7.4), 173 mM NaCl, 5mM KCI, 0.7 mM Na,HPO,, 6mM dextrose 
and 0.1mM f-mercaptoethanol. Homologous recombination was tested by 
Southern blot analysis. Embryonic stem cells were transiently transfected with a 
FRT-recombinase plasmid to delete the selection cassette, leaving a single frt site. 
Correctly targeted embryonic stem cells were microinjected into 3.5-day-old 
B6D2F1 blastocysts and transferred to the uteri of 2.5 day pseudopregnant CD1 
females. Pregnant mice carried pups to term and born chimaeras were identified 
by agouti coat colour contribution. For the germ-line transmission, male 
chimaeras were crossed to C57BL/6 female mice. Heterozygous offsprings 
(ProSAP1/Shank2*/") were confirmed by Southern blot analysis and further 
tested by PCR for the presence of the targeted allele. Cre-mediated excision was 
performed in vivo by cross-breeding mice harbouring a CMV promoter driven Cre 
transgene resulting in heterozygous ProSAP1/Shank2-deficient mice (ProSAP1/ 
Shank*’~). ProSAP2/Shank3 mutants were generated by Genoway (Lyon, France) 
and raised on a C57BL/6 background. The targeting strategy is shown in 
Supplementary Fig. 6. All mice were kept in specific pathogen-free animal facilities 
and all mouse procedures were performed in compliance with the guidelines for 
the welfare of experimental animals issued by the Federal Government of 
Germany and further approved by the ethical committee of Ile-de-France 
(CEEA Ile-de-France Comité 1). 

Primary antibodies. Two polyclonal ProSAP1/Shank2 antibodies were used for 
this study. The first one (PRC pab SA6045) was directed against amino acids 355- 
509, the second one (ppI-SAM pab SA5193) against amino acids 826-1259 of rat 
ProSAP1/Shank2. Both antisera were produced in rabbits and guinea pigs and 
have previously been characterized*'. Furthermore, a novel polyclonal ProSAP2/ 
Shank3 antibody was produced (PRC pab). This antibody was directed against 
amino acids 781-1009 and 1260-1392 within the proline-rich clusters (PRC) of rat 
ProSAP2/Shank3. For antibody production, partial complementary DNAs 
(cDNAs) of rat ProSAP2/Shank3 (base pairs 2116-2800 and 3550-3948, respect- 
ively) were each cloned into the bacterial expression vector pGEX-4T (GE 
Healthcare). The corresponding glutathione S-transferase (GST)-ProSAP2/ 
Shank3 fusion proteins were expressed in Escherichia coli BL21, purified and 
injected at the same time in rabbits to generate antiserum. For all experiments 
conducted in this study, both ProSAP1/Shank2 antisera and the ProSAP2/Shank3 
antiserum were each purified against the other family members. For this purpose, 
COS-7 cells were transfected with pEGFP-Shankl, pEGFP-ProSAP1/Shank2 
and/or pEGFP-ProSAP2/Shank3 constructs (all constructs have been described 
previously**). The corresponding fusion proteins were collected, coupled to mag- 
netic beads and loaded onto a column-based system (Miltenyi Biotec). The anti- 
sera were then purified by having them flow subsequently through the appropriate 
columns (the green fluorescent protein (GFP)-Shankl and GFP-ProSAP2/ 
Shank3 columns in case of the ProSAP1/Shank2 antisera, the GFP-Shank1 and 
GFP-ProSAP1/Shank2 columns in case of the ProSAP2/Shank3 antiserum). Two 
different pan-Shank antibodies were further used in this study. The first, pan- 
Shank ‘189.3’ was directed against the PDZ domain of human Shank] and is 
known to detect Shank1 as well as ProSAP1/Shank2 (ref. 33). The second, pan- 
Shank ‘Clone N23B/49’, was purchased from Millipore and directed against the 
SH3/PDZ domains of rat ProSAP1/Shank2 (amino acids 84-309). It is known to 
recognize all three ProSAP/Shank proteins, though. (The antigen sequence is 
approximately 75% identical to the corresponding sequence in ProSAP2/ 
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Shank3 and approximately 70% to that in Shank1.) The following antibodies were 
purchased from commercial suppliers: PSD95 (Abcam), {3-tubulin (Covance), 
GluN2A (NR2A), GluN2B (NR2B) (both Millipore), GKAP/SAPAP, Shank1 (IF) 
(Novus Biologicals), f-actin, GluN1 (NR1), Shankl (WB) (all from Sigma- 
Aldrich), pan-GluA, GluA1, GluA2, GluA3 (all from Synaptic Systems). 
Biochemistry and quantitative immunoblot analyses. To obtain the subcellular 
fractions from mouse brain analysed in this study (homogenates, soluble fractions, 
crude synaptosomal fractions, synaptic plasma membranes, one-triton extracted 
PSD fractions), a subcellular fractionation procedure was performed as described 
previously*® with minor modifications. In brief, tissue from mouse brain of both 
sexes (either brain regions and/or whole brain) was homogenized in HEPES- 
buffered sucrose (320mM sucrose, 5mM HEPES, pH 7.4) containing protease 
inhibitor mixture (Roche). The homogenate was taken for analysis and/or further 
centrifuged at 1,000g for 10 min at 4°C to remove cell debris and nuclei. The 
supernatant was spun for 20 min at 12,000g to obtain the soluble and the crude 
synaptosomal fraction (pellet P2). This fraction was used for the broad analysis of 
scaffold and receptor protein levels throughout various brain regions at two 
developmental stages (P25, P70). For some experiments, a pooled P2 fraction from 
10 whole mouse brains was further fractionated by sucrose density gradient cent- 
rifugation (0.8/1.0/1.2 sucrose) at 200,000g for 2 h at 4 °C. Purified synaptosomes/ 
synaptic plasma membranes were collected at the 1.0-1.2 M interface. To obtain 
the one-triton extracted PSD fraction, synaptosomes were re-suspended in five 
volumes of 1 mM Tris pH 8.1, stirred for 15 min on ice in buffer containing 0.5% 
Triton X-100 and centrifuged at 33,000g for 30 min. Equal amounts of 10-20 pg 
protein per lane were separated by SDS-polyacrylamide gel electrophoresis, 
stained with Coomassie gel staining solution (Thermo Scientific) or blotted onto 
polyvinylidene fluoride or nitrocellulose membranes using standard protocols. 
After incubation with specific primary antibodies, immunoreactivity was visua- 
lized on X-ray film (GE Healthcare) using HRP-conjugated secondary antibodies 
(Dako) and a SuperSignal detection system (Thermo Scientific). For quantifica- 
tion, the films were scanned and the grey value of each band was analysed by 
ImageJ (National Institutes of Health, http://rsb.info.nih.gov/ij/) and normalized 
to the grey value of B-actin and/or B3-tubulin. 

Immunohistochemistry. Immunohistochemical stainings were performed on 
frozen brain sections (51m) of adult mice from both sexes. Ice-cold methanol 
(—20 °C) was used for fixation. Cell membranes were permeabilized with Triton 
X-100 (0.5% Triton X-100 in 10mM PBS) followed by blocking with hydrogen 
peroxide solution (to inactivate endogenous peroxidase) and 2% BSA (to block 
unspecific binding sites). The sections were subsequently incubated with the first 
antibody in 0.5% BSA overnight at 4 °C in the wet chamber. For diaminobenzidene 
staining, the sections were incubated with a biotinylated secondary antibody and 
detection was ascertained using the ABC Vectastain Kit (Vector Laboratories) and 
DAB solution (0.6% DAB, 0.1% hydrogen peroxide in 5mM Tris/HCl, pH 7.4). 
For immunofluorescence staining, the sections were incubated with an Alexa 
Fluor 568 (red) fluorescence labelled secondary antibody (Invitrogen) and cell 
nuclei were counterstained with 4,6-diamidino-2-phenylindole (DAPI). 

Golgi staining. Dissected adult mouse brains from both sexes were immersed for 
21 days in Golgi-Cox solution (1% potassium dichromate, 1% mercuric chloride, 
8% potassium chromate). The brains were subsequently dehydrated and sagittal 
sections (200 j1m) were cut using a vibratome. Golgi-Cox staining was developed 
by incubation in 16% ammonia for 30 min before embedding. Z-stack images were 
taken using an upright Axioscope (Zeiss). For quantitative analysis of spine 
density, we analysed at least three neurons (pyramidal neurons from CA1 
hippocampus) from at least three independent wild-type and ProSAP1/ 
Shank2-‘~ or wild-type and ProSAP2/Shank3ap ~/~ littermate pairs. 

Electron microscopy. Adult mice from both sexes were transcardially perfused 
with fixative (2% paraformaldehyde, 2.5% glutaraldehyde, 1% saccharose in 0.1 M 
cacodylate buffer, pH 7.3) and brains were dissected out and postfixed overnight 
(2% glutaraldehyde, 1% saccharose in 0.1 M cacodylate buffer). After dehydration 
and staining with 2% uranyl acetate, the material was embedded in epoxy resin. 
Ultrathin sections were cut using an ultramicrotome. After lead citrate staining, 
the sections (CA1 hippocampus) were examined using an EM 10 electron micro- 
scope. For quantitative analysis of PSD length and thickness, we analysed at least 
three independent wild-type and ProSAP1/Shank2 ~‘~ or wild-type and ProSAP2/ 
Shank3af ~/~ littermate pairs. 

Real-time quantitative PCR. Isolation of total RNA from mouse brain tissue was 
performed using the RNeasy kit as described by the manufacturer (Qiagen). 
Isolated RNA was eluted in 20 pl of RNase-free water and stored at —80 °C. For 
the reverse transcriptase (RT)-mediated PCR studies, first strand synthesis and 
real-time quantitative RT-PCR amplification were performed in a one-step, 
single-tube format using the QuantiFast SYBR Green RT-PCR kit (Qiagen). 
Thermal cycling and fluorescent detection were performed using the Rotor- 
Gene-Q real-time PCR machine (model 2-Plex HRM) (Qiagen). The qRT-PCR 
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was assayed in 0.1 ml strip tubes in a total volume of 20 ul reaction mixture 
containing 1 il of undiluted total RNA, 2 ul of QuantiTect Primer Assay oligonu- 
cleotides, 10 ,l of 2X QuantiFast SYBR Green RT-PCR Master Mix supplemented 
with ROX (5-carboxy-X-rhodamine) dye, 6.8 il of RNase-free water (supplied 
with the kit) and 0.2 yl of QuantiFast RT Mix. RT. Amplification conditions were 
as follows: 10 min at 50 °C and 5 min at 95 °C, followed by 40 cycles of PCR for 10s 
at 95°C for denaturation, 30s at 60°C for annealing and elongation (one-step). 
During the extension, real-time fluorescence measurements were recorded by the 
PCR machine, thus monitoring real-time PCR amplification by quantitative ana- 
lysis of the fluorescence emission. The SYBR Green I reporter dye signal was 
measured against the internal passive reference dye (ROX) to normalize non- 
PCR-related fluctuations in fluorescence that occur from reaction tube to reaction 
tube. Resulting data were analysed using the hydroxymethylbilane synthase gene as 
an internal standard to normalize transcript levels. Cycle threshold (ct) values were 
calculated by Rotor-Gene-Q Software (version 2.0.2). Cycle threshold values indi- 
cate the PCR cycle number at which the measured fluorescence of the indicator dye 
(SYBR Green I), accordant to the quantity of amplified PCR products, is increasing 
in a linear fashion above background. All qRT-PCR reactions were run in dupli- 
cates in three independent experiments, and mean ct values for each reaction were 
taken into account for calculations of data analysis. To ascertain primer specificity, 
a melting curve was obtained for the amplicon products to determine their melting 
temperatures. Melting curve was driven from 60 °C to 95°C rising in 1 °C steps 
whereas fluorescence was recorded continuously. For negative controls and to 
check for reagent contamination, a complete reaction mixture was used in which 
the RNA sample was replaced by RNase-free water. Real-time quantitative PCR 
was performed using oligonucleotides to investigate expression of ProSAP1/Shank2 
and ProSAP2/Shank3 (validated primer pairs, QuantiTect primer assay, Qiagen) in 
tissue from wild-type littermates, ProSAP1/Shank2*’-, ProSAP1/Shank2'~ and 
ProSAP2/Shank3ah‘~ mutants. All consumables used for the extraction of total 
RNA and real-time PCR analysis were purchased from Qiagen. 

Primary hippocampal cultures from rat. Primary cell culture of rat hippocampal 
neurons (embryonic day 18 (E18)) was performed as described previously”. 
Hippocampal neurons were seeded on poly-L-lysine (0.1mgml_'; Sigma- 
Aldrich) coated coverslips at a density of 3 X 10* cells per well. Cells were grown 
in Neurobasal medium, complemented with B27 supplement, 0.5mM 
L-glutamine, and 100 Uml? penicillin/streptomycin (all Invitrogen) and main- 
tained at 37 °C in 5% CO). Neurons were transfected with ProSAP1/Shank2 short 
hairpin RNA (shRNA) (sequence: TG CCT TCA CCA AGA AGG AA) and/or 
scrambled control constructs* at 12 days in vitro (DIV 12). After fixation at DIV14 
and immunostaining with the appropriate antibodies, pictures were taken with an 
upright Axioscope microscope equipped with a Zeiss CCD camera. Quantification 
of fluorescence signals was performed using ImageJ (National Institutes of Health, 
http://rsb.info.nih.gov/ij/). For evaluation, fluorescent puncta (ProSAP2/Shank3, 
Shank1, GluN1, GluN2A, GluN2B, GluA1, GluA2 and GluA3) along dendrites 
within the field of view were analysed and the signal intensity values of GFP co- 
localizing (shRNA-ProSAP1/Shank2 or scrambled RNAi expressing cells) and 
non-GFP co-localizing puncta compared. 

Statistical analysis for the methods mentioned above was performed using 

Microsoft Excel for Macintosh and data were tested for significance using two- 
tailed, Student’s t-test and ANOVA. P values < 0.05 were stated as significant 
(*P < 0.05, **P<0.01, ***P< 0.001). 
Electrophysiology. ProSAP1/Shank2 mutants were raised on a C57BL/6 back- 
ground and wild-type littermates were used as a control in all experiments. The 
experimenters were blind to the genotype of the tested animals for data collection 
and analyses. Hippocampal slices were prepared from animals of both sexes aged 
P21-P28 (unless indicated otherwise), and as previously described™. Briefly, mice 
were anaesthetized with isoflurane and decapitated. Brains were rapidly removed 
and transferred to ice-cold artificial cerebrospinal fluid (ACSF) slicing solution 
containing (in mM): 87 NaCl, 26 NaHCO3, 50 sucrose, 25 glucose, 2.5 KCl, 1.25 
NaH>PO,, 3 MgCl, 0.5 CaCl, pH 7.4. Tissue blocks containing the hippocampus 
were mounted on a Vibratome (Leica VT1200) and cut into horizontal slices of 
300 pum. Slices were incubated in slicing solution at 35°C for 30 min, cooled to 
room temperature and transferred to ACSF containing (in mM): 119 NaCl, 26 
NaHCOs, 10 glucose; 2.5 KCl, 2.5 CaCl, 1.3 MgCl, 1 NaH2POy,. All ACSF was 
equilibrated with carbogen (95% O2, 5% COz). Slices were stored under sub- 
merged conditions for 30min to 6h before being transferred to a submerged 
recording chamber (Luigs and Neumann) where they were perfused with ACSF 
maintained at room temperature at a rate of 3-4 ml min |. 

Extracellular field and whole-cell patch-clamp recordings were performed. 
Stimulation and recording pipettes were pulled from borosilicate glass capillaries 
(Harvard Apparatus; 1.5mm outside diameter, with a micropipette electrode 
puller (DMG Universal Puller)). Evoked postsynaptic responses were induced 
by stimulating Schaffer collaterals (0.1 Hz) in CA1 stratum radiatum with pipettes 


of 201m tip diameter (filled with ACSF), using a stimulus isolater (ISO-flex, 
A.M.P.L). fEPSPs were recorded with the same kind of pipettes that were placed 
in stratum radiatum. fEPSP slopes were determined as dV/dt of the 20-80% 
amplitude from averages of five individual traces. Long-term potentiation was 
induced by a single tetanus of 100 pulses at 100 Hz. For long-term depression 
experiments, CA1 was isolated by a microcut set at the border of CA2/CA3 
immediately before the experiment, and recordings were made in the presence 
of 1M gabazine. Long-term depression was induced by 15 min paired-pulse 
stimulation at 1 Hz with 50 ms between single pulses. For whole-cell patch-clamp 
recordings, pipettes had resistances of 2-3 MQ. CA1 pyramidal cells were held at 
—60 mV in voltage-clamp mode (not corrected for liquid junction potential). 
Series resistance (not compensated) was constantly monitored and was not 
allowed to increase beyond 22 MQ or change more than 20% during the experi- 
ment. mEPSCs and whole-cell AMPA currents were recorded in the presence of 
1M tetrodotoxin, 0.1mM cyclothiazide and 1 1M gabazine with a potassium- 
based intracellular recording solution containing (in mM): 135 K-gluconate, 10 
HEPES, 0.5 EGTA, 20 KCl, 2 MgATP and 5 phosphocreatine. Osmolarity was 
300 mOsm; pH was adjusted to 7.2 with KOH. mEPSC events were detected with a 
threshold-based algorithm and their amplitudes calculated from a 1 ms time win- 
dow around the peak. Events were aligned by their rise time before averaging. The 
mEPSC frequency was determined from a 3 min time window. The analysis of 
mEPSC amplitudes included the first 42 events of each recorded cell (one cell with 
fewer events was excluded from amplitude analysis). Cellular input resistance was 
calculated from the steady-state current measured in response to a hyperpolarizing 
test pulse of 50 ms duration at a holding potential of -60 mV. NMDA/AMPA 
ratios were recorded in ACSF containing 4mM CaCl, 4mM MgCl, and 11M 
gabazine with a caesium-based intracellular recording solution containing (in 
mM): 145 CsCl, 10 HEPES, 0.2 EGTA, 2 MgCl, 2 NaATP, 0.5 NaGTP and 5 
phosphocreatine. Osmolarity was 305 mOsm; pH was adjusted to 7.2 with CsOH. 
For estimation of NUDA/AMPA ratios, compound EPSCs were evoked at —60 
and +40 mV. Stimulation was adjusted to produce a single-peaked short-latency 
response. Ten consecutive EPSCs for each holding potential were averaged. The 
AMPA receptor-mediated component of the EPSC was estimated by measuring 
the peak amplitude of the averaged EPSC at —60 mV. The NUDAR-mediated 
component was estimated at +40 mV by measuring the amplitude of the averaged 
EPSC 75 ms after stimulation. IPSCs were recorded in the presence of 10 1M 
NBQX and 10M APV with a potassium-chloride-based intracellular recording 
solution containing (in mM): 145 KCl, 10 HEPES, 0.1 EGTA, 2 MgCl), 2 Na,ATP. 
Osmolarity was 305 mOsm; pH was adjusted to 7.3 with KOH. For mIPSC record- 
ings, 1 j.M tetrodotoxin was added to the bath. The IPSC frequency was deter- 
mined from a 2 min time window. The analysis of mIPSC (sIPSC) amplitudes 
include the first 120 (260) events of each recorded cell. All drugs were purchased 
from Tocris Bioscience. 

Recordings were performed with an Axopatch 700A Amplifier (Axon 
Instruments). Data were acquired using a BNC-2090 adaptor chassis, digitized 
at 5kHz (PCI 6035E A/D Board, National Instruments), filtered at 1 kHz and 
recorded in IGOR Pro 4.0 using custom made plug-ins (WaveMetrics). 

Analyses were performed using custom-written procedures in IGOR Pro 
(WaveMetrics) and MATLAB (The Mathworks). mEPSCs were analysed with 
NeuroMatic (http://www.neuromatic.thinkrandom.com). Data in graphs and the 
text are presented as mean + s.e.m. Statistical comparisons between groups were 
performed in GraphPad Prism (GraphPad Software) with unpaired two-tailed 
Student’s t-tests and two-way repeated-measures ANOVA. A Kolmogorov— 
Smirnov test was performed in MATLAB. Results were considered significant at 
P<0.05. Stimulus artefacts were blanked in sample traces. Sample sizes are given as 
number of experiments (n) and number of animals (N). Data in bar graphs are 
presented as mean with standard error; grey circles show individual data points. 
Behavioural studies. The behavioural studies included several cohorts of mice. 
Cohort 1 (backcrossed for 10 generations on C57BL/6) was used for the develop- 
mental study of pups. Cohort 2 included 3- to 6-month-old adult mice (back- 
crossed for 11 generations on C57BL/6). The general neurological examination 
and the rotarod experiments were conducted with cohort 3 of 6- to 8-month-old 
adult mice (backcrossed for 11 generations on C57BL/6). We tested the offspring 
(that is, wild-type, ProSAP1/Shank2*’~ and ProSAP1/Shank2 ‘~ littermates) of 
ProSAPI/Shank2*’~ mice crossings. The experimenters were blind of the geno- 
type of the tested animals for data collection and analyses. 

General parameters indicative of the health and neurological state were 
addressed following the neurobehavioural examination described by Whishaw 
et al.** and the tests of the primary screen of the SHIRPA protocol except startle 
response’. For the developmental study, males were separated from pregnant 
females 1 or 2 days before birth. Births were checked each morning and evening. 
Pups of both sexes (cohort 1) were individually identified with long-lasting sub- 
cutaneous tattoos (green tattoo paste, Ketchum Manufacturing) on the paws on 
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P1. Each pup was isolated from dam and littermates and placed in a small enclosure 
with soft plastic surface in a soundproof chamber (temperature T= 24 + 1 °C). 
Audio recordings lasted 5 min and were conducted every 2 days (P2, P4, P6, P8, P10 
and P12). Recording hardware (UltraSoundGate 416-200, condenser ultrasound 
microphone Polaroid/CMPA) and software (Avisoft SASLab Pro Recorder) were 
from Avisoft Bioacoustics (sampling frequency 300 kHz; fast Fourier transform 
length 1,024 points; 16-bit format). After each audio recording, developmental 
milestones (weight, opening of the eyes, extroversion of the ears and incisive erup- 
tion) were noted. Tests for general sensory perception and motor coordination 
(righting reflex (P2-P12), home cage odour preference (P7), reaction to a click 
sound from a pen 2 cm behind the ears (P14)) were conducted at least 30 min after 
audio recordings. 

Righting reflex. The pup was turned on its back ona flat surface. The latency until 
it reached a normal position (face down) was measured. When the pup did not 
succeed in righting up, latency was set at the maximum time allowed (120s). 
Preference for home cage odour. A plastic cage (20cm X 30cm X 6 cm) was 
divided in three zones. One side of the cage was covered with bedding from the 
nest cage; the other side was covered with fresh bedding. A neutral zone (width 
2.5 cm) separated these two zones. The 7-day-old pup was placed in the middle of 
the neutral zone. Time spent by the nose of the pup in each zone was measured 
within 60 s. This test was repeated for each pup and the mean of the two tests was 
used in the analyses. 

To examine adult behaviour, mice of both sexes aged between 3 and 6 months 
(cohort 2) were tested in the following order: light-dark anxiety test, open field, 
Y-maze, three-chamber test, self-directed and digging behaviour, resident-intruder 
test, male behaviour in presence of an oestrus female, buried-food finding test and 
object recognition. All animals were weighed at 4 months of age, and their weight was 
compared with age- and sex-matched commercially available C57BL/6 mice. At the 
same age, mice were held by the tail 20 cm above a table for approx. 1 min to check for 
hindlimb clasping. Males were housed individually from weaning onwards, given 
their high aggressiveness towards each other. Females were housed in groups (except 
for the resident-intruder test; see hereafter). At least 2 days elapsed between two 
consecutive tests. Unless otherwise specified, we tested and analysed 16 males per 
genotype and 16 females per genotype (but only 13 ProSAP1/-Shank2 ‘~ females). 
Settings were cleaned with soap and water, and dried with paper towels between each 
mouse. When bedding was used, new fresh bedding was used for each mouse. Tests 
using audio-recordings were conducted in a soundproof chamber. 

Light-dark anxiety test. The setting consisted of a white, brightly illuminated 
compartment (1300 1x) connected to a black, very dark one (3 1x) through a small 
door. The tested mouse was introduced into the white compartment and allowed 
to explore the whole apparatus for 5 min. The latency to enter the dark compart- 
ment, the time spent in each compartment and the number of transitions between 
compartments were measured. 

Open field. The tested mouse was allowed to explore freely for 30 min a round, 
open-field arena of 1m diameter (100 1x in the centre of the arena). Automatic 
detection of the mouse (custom software of P. Faure) recorded the total distance 
travelled, and the time spent in the central zone versus time spent at the periphery of 
the maze. 

Y-maze. The tested mouse was allowed to explore freely a Y-shaped labyrinth for 
3 min. The number of visits in each arm of the labyrinth and sequences of entrance 
in each arm were recorded. 

Three-chamber test. A Plexiglas cage was divided in three compartments as 
previously described’’. Both side compartments contained an empty perforated 
cup (side compartments: 1501x; central compartment: 140 Ix). First, the tested 
mouse was allowed to explore freely the whole setting, with all doors open for 
10 min (phase 1). After this habituation period, the mouse was restricted in the 
central compartment, while an unfamiliar C57BL/6 mouse of the same sex 
(stranger 1) was placed under one of the cups (sides alternated between each 
mouse). The tested mouse was then allowed to explore the whole apparatus for 
10 min (phase 2). After that, it was restricted to the central compartment while 
another unfamiliar C57BL/6 mouse of the same sex (stranger 2) was placed under 
the other cup. The tested mouse could then again freely explore the whole appar- 
atus for 10 min (phase 3). In all three phases, time spent in each compartment and 
number of transitions between compartments were automatically recorded. Times 
spent in contact with the cup containing the mouse (stranger 1) and the empty cup 
were manually measured in phase 2. Times spent in contact with the cup contain- 
ing the unfamiliar mouse (stranger 2) and in contact with the cup containing the 
familiar mouse (stranger 1) were manually measured in phase 3. 

Self-directed and digging behaviours. The tested mouse was placed in a new test 
cage (Plexiglas, 50cm X 25cm X 30cm; 1001x; clean sawdust bedding) in a 
soundproof chamber. After 10 min habituation, its behaviour was video-recorded 
for 10 min. We recorded the time spent self-grooming and digging in the bedding, 
as well as the number of self-grooming and digging bouts. 
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Resident-intruder test. Female mice were housed individually 3 days before the 
testing day to increase their social motivation. Males were reared in social isolation 
from weaning onwards (see above). The tested mouse was left 30 min for habituation 
in the test cage (Plexiglas, 50 cm X 25 cm X 30cm; 100 1x; clean sawdust bedding; 
soundproof chamber**). After this time, an unfamiliar C57BL/6 mouse of the same 
sex was introduced. The two animals were allowed to interact freely for 4 min. Social 
interactions were videotaped continuously (high-resolution Sony XCD-SX90CR 
video camera). At the same time, ultrasonic vocalizations were recorded with a 
condenser ultrasound microphone Polaroid/CMPA, the interface UltraSoundGate 
416-200 and the software Avisoft SASLab Pro Recorder from Avisoft Bioacoustics 
(sampling frequency: 300 kHz; fast Fourier transform length: 1024 points; 16-bit 
format). Behaviours were encoded manually. We recorded the latency for the first 
contact, the time spent in contact, as well as the latency for the first ultrasonic 
vocalization, the call rate and the distribution of the different call types. 
Male-female interactions. All males had a previous contact of 3 days with a 
female at least 3 days before the experiment. The tested male was left in a test cage 
(Plexiglas, 50cm X 25cm X 30cm; 1001x; clean sawdust bedding; soundproof 
chamber) to habituate for 10 min. An unfamiliar C57BL/6 female in oestrus (tested 
though vaginal smears) was then introduced. Ultrasonic vocalizations and social 
interactions were recorded for 3 min with the same setting as cited above. The 
latency for the first contact, the time spent in contact, as well as the latency for the 
first ultrasonic vocalization, the call rate and the distribution of the different call 
types, were recorded. 

Buried-food finding test. Four days before testing, female mice were housed 
individually, like males. Each day, all mice were given four pieces of cocoa- 
flavoured crisped rice cereals (Coco Pops, Kellogg’s) with restricted access to their 
usual food (SAFE) but water ad libitum. Mice that did not eat their cocoa- 
flavoured cereals were excluded from the test. Twelve hours before testing, mice 
were food-deprived, with water ad libitum. On the day of the test, six pieces of 
Coco Pops were buried 1.5 cm under sawdust bedding in a clean cage. The tested 
mouse was placed in the opposite corner. The latency for the mouse to retrieve the 
food was measured (maximum time: 3 min). For each animal tested, a new clean 
cage with fresh bedding was used. 

Object recognition. To test ProSAP1/Shank2 mutants’ ability to differentiate 
objects, we used the set up of the three-chamber test. In the habituation phase, the 
mouse was allowed to explore the whole setting freely, without any object inside. 
After that, the mouse was restricted in the central compartment and one identical 
electric socket was placed in each side compartment. The mouse was again allowed to 
explore the setting freely with the two identical objects for 10 min. After this second 
period, while the mouse was restricted in the central compartment, one of the sockets 
was replaced by a copper joint. Again, the mouse was allowed to explore the complete 
setting for 10 min. In all three phases, the time spent in each compartment and the 
number of transitions between compartments was automatically recorded. Time 
spent in contact with each object was manually measured in each phase. 

Rotarod test. Animals of cohort 3 received two training sessions (3 h interval) ona 
rotarod apparatus (TSE Systems) with increasing speed from 4 to 40 r.p.m. for 5 
min. After 4 days, mice were tested at 16, 24, 32 and 40 r.p.m. constant speed. The 
latency to fall off the rod was measured””. 

Analyses of audio recordings. In the pup developmental study, given that there 
were no significant differences between the number of ultrasonic vocalizations 
automatically detected (pulse train detection analyses from Avisoft SASLab Pro; 
hold time 7 ms) and the number of calls manually detected (data not shown), we 
used the automatic detection of calls to compare the global call rate between 
genotypes for each age. For vocalizations recorded in adult animals, we manually 
detected the calls in the software Avisoft SASLab Pro (Avisoft; 75% overlap; time 
resolution 0.853 ms; frequency resolution 293 Hz; Hamming window). 

For pups and adults, we labelled the calls with the labelling function of Avisoft 
SASLab Pro. Each call was classified in one call category of 11, adapted from ref. 40 as 
follows. (1) Short: duration shorter than 5 ms; frequency range = 6.25 kHz. (2) Flat: 
duration longer than 5 ms and frequency range = 6.25 kHz. (3) Upward: increase in 
frequency; frequency range greater than 6.25 kHz with only one direction of fre- 
quency modulation. (4) Downward: decrease in frequency; frequency range greater 
than 6.25 kHz with only one direction of frequency modulation. (5) Modulated: 
frequency modulations in more than one direction; frequency range greater than 
6.25 kHz. (6) Complex: addition of one or more frequency component (not 
harmonic). (7) One frequency jump: inclusion of one jump in frequency without 
time gap between the two frequency components. (8) Frequency jumps: inclusion of 
more than one jump in frequency without time gaps between the two consecutive 
frequency components. (9) Mixed: inclusion ofa noisy (‘unstructured’) part within a 
pure tone call. (10) Unstructured: no pure tone component identifiable; ‘noisy’ calls. 
(11) Others: include all the calls that did not fit in any of the preceding categories (for 
example, calls combining features of several of the previous call types). 
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Unless otherwise specified, we used Mann-Whitney U-tests to compare the 


behavioural traits measured between genotypes or within genotypes because of the 
non-normal distribution of the data and the limited sample size in many cases. We 
compared the distribution of the different call types between genotypes using 7 
tests. All analyses were conducted with the computing and statistical software R (R 
Developmental Core Team 2009) and data are presented as mean + s.e.m. For 
exact values of statistical analyses, see Supplementary Tables. 
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Autistic-like social behaviour in Shank2-mutant 
mice improved by restoring NMDA receptor function 


Hyejung Won!**, Hye-Ryeon Lee**, Heon Yung Gee**, Won Mah**, Jae-Ick Kim**, Jiseok Lee'?, Seungmin Ha’”, 
Changuk Chung!?, Eun Suk Jung’, Yi Sul Cho°®, Sae-Geun Park", Jung-Soo Lee*, Kyungmin Lee®, Daesoo Kim', Yong Chul Bae’, 


Bong-Kiun Kaang*’, Min Goo Lee* & Eunjoon Kim'**? 


Autism spectrum disorder (ASD) is a group of conditions charac- 
terized by impaired social interaction and communication, and 
restricted and repetitive behaviours. ASD is a highly heritable 
disorder involving various genetic determinants’. Shank2 (also 
known as ProSAP1) is a multi-domain scaffolding protein and 
signalling adaptor enriched at excitatory neuronal synapses” *, 
and mutations in the human SHANK2 gene have recently been 
associated with ASD and intellectual disability’. Although ASD- 
associated genes are being increasingly identified and studied 
using various approaches, including mouse genetics***, further 
efforts are required to delineate important causal mechanisms with 
the potential for therapeutic application. Here we show that Shank2- 
mutant (Shank2~'~) mice carrying a mutation identical to the 
ASD-associated microdeletion in the human SHANK2 gene exhibit 
ASD-like behaviours including reduced social interaction, reduced 
social communication by ultrasonic vocalizations, and repetitive 
jumping. These mice show a marked decrease in NMDA (N-methyl- 
D-aspartate) glutamate receptor (NMDAR) function. Direct stimu- 
lation of NMDARs with p-cycloserine, a partial agonist of NUDARs, 
normalizes NMDAR function and improves social interaction in 
Shank2~’~ mice. Furthermore, treatment of Shank2~/~ mice witha 
positive allosteric modulator of metabotropic glutamate receptor 5 
(mGluR5), which enhances NMDAR function via mGluR35 activa- 
tion’’, also normalizes NMDAR function and markedly enhances 
social interaction. These results suggest that reduced NMDAR func- 
tion may contribute to the development of ASD-like phenotypes in 
Shank2"'~ mice, and mGluR modulation of NMDARs offers a 
potential strategy to treat ASD. 

Mutations in the SHANK2 gene have recently been identified in indi- 
viduals with ASD and intellectual disability*'*. Among these mutations, 
one de novo SHANK2 microdeletion found in ASD leads to loss of exons 
6 and 7 and a frame shift, with concomitant removal of the PDZ and 
following domains in SHANK2 proteins. To explore the possibility that 
this deletion causes ASD in humans, and to study the mechanisms 
underlying the development of ASD, we generated transgenic mice 
carrying a mutation identical to the human microdeletion (exons 6 
and 7 deletion and a frame shift), which affects both splice variants of 
Shank2 (Shank2a and Shank2b) in mice (Fig. 1a). The deletion was 
verified by Southern blotting and various PCR methods (Supplemen- 
tary Fig. 1). Shank2 proteins were undetectable in the brain (Fig. 1b), and 
there were no compensatory increases in Shankl or Shank3 
(Supplementary Fig. 1). The Shank2-'~ mice showed normal reproduc- 
tion and brain structure (Supplementary Fig. 2). 

We first examined whether Shank2 '~ mice displayed autistic-like 
impairments in social interaction. In a home-cage social interaction 


assay, Shank2 ‘~ mice showed reduced interaction with normal target 
mice, as compared with wild-type animals (Supplementary Fig. 3). Ina 
three-chamber social interaction assay, wild-type animals preferred to 
explore the first novel mouse introduced (stranger 1) over an inan- 
imate object relatively more than did Shank2~'~ mice (Fig. 1c, d and 
Supplementary Fig. 4). Next, when the object was replaced by another 
novel mouse (stranger 2), Shank2~'~ mice preferred to explore 
stranger 2 over stranger 1, similar to wild-type animals (Fig. le), 
indicative of normal levels of social novelty recognition. Similar 
results were obtained when we used juvenile Shank2-'~ mice 
(Supplementary Fig. 5). Shank2~'~ mice had normal olfactory func- 
tion (Supplementary Fig. 6). 

Shank2~'~ mice showed impaired spatial learning and memory in 
the Morris water maze, although novel object recognition memory was 
normal (Supplementary Fig. 7). These results suggest that Shank2-'~ 
mice have partially impaired learning and memory, consistent with the 
idea that the deletion in exons 6 and 7 in humans causes ASD and mild 
to moderate intellectual disability”. 

Shank2~'~ mice showed impairments in social communication by 
ultrasonic vocalizations (USVs). When allowed to interact with a novel 
wild-type female mouse, Shank2~'~ male mice uttered USVs less fre- 
quently than did wild-type animals, and took longer to make the first 
call (Fig. 1f-h). In a pup retrieval assay, Shank2~‘~ female mice 
retrieved the pups less efficiently than did wild-type mice (Fig. li and 
Supplementary Fig. 8, Supplementary Movies 1 and 2). 

Shank2~‘~ male animals exhibited other autistic-like abnormalities. 
When kept alone in stranger-free home cages, Shank2~'~ mice 
showed enhanced jumping mostly mixed with upright scrabbling, 
normal grooming, and decreased digging behaviours (Fig. 1j and 
Supplementary Movies 3-5). Shank2-"~ mice also displayed impaired 
nesting behaviour, hyperactivity in assays including the open field test, 
anxiety-like behaviour in an elevated plus maze, and increased groom- 
ing in a novel object recognition arena (Supplementary Figs 4, 6 and 7). 
Shank2~‘~ female mice showed similar repetitive jumping, hyperac- 
tivity in an open field, and anxiety-like behaviour in an elevated plus 
maze, although not in a light-dark box (Supplementary Fig. 8). These 
data collectively suggest that Shank2~'~ mice show ASD-like beha- 
viours. It should be noted that the hyperactivity and anxiety-like beha- 
viours might contribute to the impaired social interaction in 
Shank2~‘~ mice by limiting target exploration or evoking anxiety-like 
responses. 

We also characterized heterozygous Shank2 (Shank2*'~) mice, 
because human gene mutations are mostly heterozygotic. Shank2*/~ 
mice showed hyperactivity, similar to Shank2-‘~ mice (Supplementary 
Fig. 9). However, they showed no abnormalities in social interaction, 
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Figure 1 | Shank2~/~ mice exhibit ASD-like impaired social interaction and 
social communication, and repetitive jumping. a, Targeting of the Shank2 
gene in mice. Ex, exon. b, Shank2~/~ brain lacks expression of both Shank2a 
and Shank2b splice variants. Mo, mock; $2, Shank2; S2E, epithelial form of 
Shank2. c-e, Impaired social interaction of Shank2~/~ mice in three-chamber 
assays. KO, knockout; WT, wild type. d, Social preference (object versus 


repetitive behaviours, or anxiety-like behaviours, reflecting intrinsic 
differences between humans and mice. 

Shank2 is an important regulator of excitatory synaptic structure 
and function? *””. Shank2 deletion, however, had minimal effects on 
excitatory or inhibitory synapses (Supplementary Fig. 10). In addition, 
electron microscopy revealed that excitatory synapse number and 
postsynaptic density morphology were unaltered (Supplementary 
Fig. 11). 

We next measured synaptic transmission at hippocampal Schaffer- 
collateral-CA 1-pyramidal (SC-CA1) synapses. Basal excitatory trans- 
mission such as input-output and paired-pulse ratio was unchanged in 
Shank2~'~ mice (Fig. 2a, b). In addition, spontaneous transmission 
and membrane excitability were normal in mutant animals (Sup- 
plementary Fig. 12). When synaptic plasticity was tested, long-term 
potentiation (LTP) induced by high-frequency stimulation or theta- 
burst stimulation was severely impaired in Shank2-'~ mice (Fig. 2c 
and Supplementary Fig. 13). Long-term depression (LTD) induced 
by low-frequency stimulation was completely abolished also in 
Shank2~'~ mice (Fig. 2d). Because LTD induced by low-frequency 
stimulation activates both NMDARs and mGluRs”, we isolated 
mGluR LTD by bath-applying (RS)-3,5-dihydroxyphenylglycine 
(DHPG), an agonist of mGluR5, but found no difference between 
genotypes (Supplementary Fig. 13). This suggests that the observed 
reductions in LTP and LTD may be due to NMDAR hypofunction. 

We thus measured the NMDA/AMPA (a-amino-3-hydroxy-5- 
methyl-4-isoxazole propionic acid) ratio at Shank2~'~ SC-CAI 
synapses. Indeed, the NUDA/AMPA ratio was reduced relative to 
wild-type synapses (Fig. 2e). Meanwhile, both the decay kinetics of 
NMDAR excitatory post-synaptic currents (EPSCs) and GluN2B- 
mediated EPSCs (GluN2B also known as NR2B or Grin2b) were indis- 
tinguishable between genotypes, suggesting that GluN2A- (also 
known as NR2A or Grin2a) and GluN2B-containing NMDARs were 
equally affected (Supplementary Fig. 14). Given that AMPA receptor 
(AMPAR)-mediated transmission is normal (Fig. 2a), these results 
suggest that NNUDAR-mediated transmission is selectively decreased. 
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stranger 1 (S1-O)). e, Social novelty recognition (stranger 1 versus stranger 2 
(S2-S1)). 1 = 11 (WT), 10 (KO). fh, Impaired social communication by USVs 
in Shank2-’~ mice. n = 10 (WT), 10 (KO). i, Impaired pup retrieval in 
Shank2~/~ mice. n = 5(WT),5 (KO). j, Stereotypical behaviours in Shank2-/— 
mice. n = 11 (WT), 11 (KO). *P < 0.05, **P < 0.01, ***P < 0.001, NS, not 
significant. Data represent mean + standard error. 


The NMDA/AMPA ratio in the medial prefrontal cortex, however, 
was unaltered in Shank2'~ mice (Supplementary Fig. 14), suggesting 
that the reduced NUDA/AMPA ratio is not a change uniformly occur- 
ring in all brain regions. 

Shank2 deletion may also affect NMDAR-associated signalling that 
critically regulates various synaptic events including LTP and LTD’!”. 
In immunoblot analyses, phosphorylation but not total levels of 
CaMKII-c/B (1286), ERK1/2 (p42/44) and p38 were significantly 
reduced in the Shank2~'~ brain (Supplementary Fig. 15). A similar 
decrease was observed in phosphorylation of the AMPAR subunit 
GluA1 (S831 and S845). There were no changes in phosphorylation 
of PAK1/3 and mTOR, total levels of glutamate receptors (GluN2A, 
GluA2 (also known as GluR2 or Gria2) and mGluR1/5 (also known as 
Grm1/5)), or total levels of synaptic scaffolds and signalling adaptors/ 
proteins directly or indirectly associated with Shank2 including PSD- 
95 (also known as Dlg4), SAP97 (also known as Dlg1), GKAP (also 
known as Dlgap1), SynGAP1, Homer1l, Arhgef6/7 (also known as 
a/BPIX), GIT1 and PLC-B3. The increase in GluN1 expression may 
reflect a compensatory increase. These results suggest that Shank2 
deficiency leads to impairments in NMDAR-associated signalling. 

Reduced NMDAR function and associated signalling may con- 
tribute to ASD-like behaviours in Shank2~'~ mice. To test this hypo- 
thesis directly and restore NMDAR function, we used D-cycloserine, a 
partial agonist at the glycine-binding site of NMDARs, which has been 
shown to rescue repetitive grooming in neuroligin-1-deficient mice 
associated with a reduced NMDA/AMPA ratio”. We found that 
p-cycloserine fully recovered the NUDA/AMPA ratio (Fig. 3a). In addi- 
tion, D-cycloserine-treated Shank2~'~ mice showed improved social 
interaction in three-chamber social interaction assays (Fig. 3b-d and 
Supplementary Fig. 16). 

To explore further the association between reduced NMDAR 
function and ASD-like behaviours in Shank2~‘~ mice, we used 
3-cyano-N-(1,3-diphenyl-1H-pyrazol-5-yl)benzamide (CDPPB), a 
membrane-permeable positive allosteric modulator of mGluR5, which 
increases the responsiveness of mGluR5 to glutamate and enhances 
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Figure 2 | Impaired NMDAR-dependent synaptic plasticity in Shank2~/~ 
mice. a, Normal input-output curve at hippocampal SC-CA1 synapses in 
Shank2~/~ mice. n = 7 (WT), 8 (KO). fEPSP, field excitatory postsynaptic 
potential. b, Normal paired-pulse ratio. n = 10 (WT), 8 (KO). ¢, Impaired LTP. 


NMDAR function'”**’°. CDPPB has antipsychotic and pro-cognitive 
activities'’**7°°8, and facilitates behavioural flexibility”. In addition, 
CDPPB restores reduced excitatory transmission and ERK phosphor- 
ylation caused by Shank3 knockdown”, and CDPPB and its derivative 
(VU-29) enhance both LTP and LTD, and spatial learning”. 
Consistent with previous findings, CDPPB also normalized the 
NMDA/AMPA ratio in Shank2~~ brain slices (Fig. 4a). Moreover, 
CDPPB restored the impaired LTP and LTD at SC-CAI1 synapses 
(Fig. 4b, c), without affecting basal synaptic transmission (Supplemen- 
tary Fig. 17). Biochemically, CDPPB treatment of Shank2-'~ mice 
fully normalized NMDAR signalling in Shank2~ whole brains and 
also in Shank2" synaptosomes (Supplementary Figs 18). The smaller 


Time (min) 


n= 6 (WT), 8 (KO). d, Impaired LTD. n = 10 (WT), 9 (KO). e, Reduced 
NMDA/AMPA ratio. n = 8 (WT), 8 (KO). *P < 0.05, **P < 0.01, 
***P < 0,001. Data represent mean + standard error. 


extent of signalling deficits in older mice (8 weeks) relative to younger 
mice (3-4 weeks) may reflect age-dependent reductions in NMDAR- 
mediated currents and/or compensatory changes in NMDAR signalling. 

Behaviourally, Shank2~'~ mice treated with CDPPB showed sub- 
stantial recoveries in social interaction to a greater extent than those 
treated with D-cycloserine, while having no effect on social novelty 
recognition (Fig. 4d-f and Supplementary Fig. 19). A lower dose of 
CDPPB did not rescue impaired social interaction (Supplementary 
Fig. 20), indicative of a dose-dependent action. Notably, CDPPB did 
not rescue impaired pup retrieval, repeated jumping, anxiety-like 
behaviours and hyperactivity (Supplementary Fig. 21), suggesting that 
CDPPB selectively rescues social interaction, but not other behaviours. 
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Figure 3 | p- cycloserine 1 normalizes NMDAR function and improves social 
interaction in Shank2~’~ mice. a, D-cycloserine (20 |1M) recovers the reduced 
NMDA/AMPA ratio. n = 13 (wild type, vehicle (WT-V)), 9 (knockout, vehicle 
(KO-V)), 9 (knockout, D-cycloserine (KO-D)). b-d, D-cycloserine 


5 Distance moved (m) 5 


(20 mgkg™) partially normalizes the impaired three-chamber social 
interaction in Shank2-/~ mice. n = 9 (WT-V), 10 (WT-D), 11 (KO-V), 10 
(KO-D). *P < 0.05, **P < 0.01, ***P < 0.001, NS, not significant. Data 
represent mean + standard error. 
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Figure 4 | CDPPB normalizes NMDAR function and substantially 
improves social interaction in Shank2~/~ mice. a, CDPPB (10 iM) restores 
the NMDA/AMPA ratio at Shank2 ‘~ SC-CA1 synapses. n = 13 (wild type, 
vehicle (WT-V)), 8 (wild type, CDPPB (WT-C)), 9 (knockout, vehicle (KO-V)), 
9 (knockout, CDPPB (KO-C)). b, c, CDPPB (10 tM) recovers impaired LTP 
and LTD. n = 5 (WT-V), 5 (WT-C), 5 (KO-V), 6 (KO-C) for high-frequency 


These results, together with the D-cycloserine results, suggest that 
reduced NMDAR function and signalling lead to impaired social inter- 
action in Shank2~ mice, although NMDAR-independent mechan- 
isms may also have a role. 

Recently, another line of Shank2" mice produced by deleting exon 7 
has been reported to display multiple phenotypes, including reduced 
spine number, reduced basal transmission, elevated NMDAR currents 
and ASD-like behavioural changes*’. Given that our Shank2~ mice 
lack both exons 6 and 7, the observed differences in mouse phenotype 
might reflect the differences in genetic deletions and are in line with the 
different ASD symptoms observed in humans’. In addition, the fact 
that both reduced and enhanced NMDAR functions lead to the same 
ASD-like phenotypes in mice suggest that maintaining normal levels of 
NMDAR function is important. 

We have demonstrated that NMDAR function is an important 
mechanism underlying the development and rescue of ASD-like 
phenotypes in Shank2-‘~ mice, and that mGluR5 may be a novel 
target for the treatment of ASD involving altered NMDAR function. 


METHODS SUMMARY 


Animals and statistical analysis. Numbers, genders and ages of mice used for 
behavioural and other assays are summarized in Supplementary Table 1. All 
behavioural and electrophysiological assays were performed and analysed in a 
blind manner. Statistical analyses were performed using SPSS 12.0 (SPSS) and 
OriginPro (OriginLab), and details of the results are described in Supplementary 
Table 2. 

Experimental details of mouse generation and characterization by behavioural, 
electrophysiological, biochemical and immunohistochemical analyses are 
described in Supplementary Methods. 
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The deubiquitinase USP9X suppresses pancreatic 


ductal adenocarcinoma 
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Pancreatic ductal adenocarcinoma (PDA) remains a lethal malig- 
nancy despite much progress concerning its molecular character- 
ization. PDA tumours harbour four signature somatic mutations’ * 
in addition to numerous lower frequency genetic events of uncer- 
tain significance’. Here we use Sleeping Beauty (SB) transposon- 
mediated insertional mutagenesis® in a mouse model of pancreatic 
ductal preneoplasia* to identify genes that cooperate with oncogenic 
Kras®'?? to accelerate tumorigenesis and promote progression. 
Our screen revealed new candidate genes for PDA and confirmed 
the importance of many genes and pathways previously implicated 
in human PDA. The most commonly mutated gene was the 
X-linked deubiquitinase Usp9x, which was inactivated in over 
50% of the tumours. Although previous work had attributed a 
pro-survival role to USP9X in human neoplasia’, we found instead 
that loss of Usp9x enhances transformation and protects pancreatic 
cancer cells from anoikis. Clinically, low USP9X protein and 
messenger RNA expression in PDA correlates with poor survival 
after surgery, and USP9X levels are inversely associated with 
metastatic burden in advanced disease. Furthermore, chromatin 
modulation with trichostatin A or 5-aza-2’-deoxycytidine elevates 
USP9X expression in human PDA cell lines, indicating a clinical 
approach for certain patients. The conditional deletion of Usp9x 
cooperated with Kras@’”” to accelerate pancreatic tumorigenesis in 
mice, validating their genetic interaction. We propose that USP9X 
is a major tumour suppressor gene with prognostic and therapeutic 
relevance in PDA. 

The biological sequelae of PDA has been partially attributed to 
frequent and well characterized mutations in KRAS (>90%), 
CDKN2A (>90%), TP53 (70%) and SMAD4 (55%)'~*. Recent genome- 
wide analyses have uncovered numerous additional somatic genetic 
alterations, although the functional relevance of most remains uncertain®. 
To explore the molecular genesis of PDA we previously generated a 
mouse model of pancreatic intraepithelial neoplasia (mPanIN) by con- 
ditionally expressing an endogenous Kras@'”” allele in the developing 
pancreas®. Mice with mPanIN spontaneously progress to mouse PDA 


(mPDA) after a long and variable latency, providing an opportunity to 
characterize genes that cooperate with Kras@’’? to promote early 
mPDA. We hypothesized that such genes could be directly identified 
by applying insertional mutagenesis strategies®”'°'' in our mPanIN 
model, and that these candidates could represent ‘drivers’ of PDA 
development. 

Accordingly, we interbred our mPanIN model with two distinct SB 
transposon systems and monitored mice for early disease progression. 
Our initial approach used the well characterized CAGGS-SB10 trans- 
genic allele to promote transposition’. Although CAGGS-SB10 
promoted PDA, a variety of non-pancreatic neoplasms and a paucity 
of identified common insertion sites (CIS) in the recovered pancreatic 
neoplasms precluded a comprehensive analysis, potentially reflecting 
the variegated expression of CAGGS-SB10 (ref. 12) (Supplementary 
Figs la and 2, and Supplementary Tables 1 and 3b). 

To increase the specificity and potency of SB mutagenesis, we 
generated a conditional SB13 mutant mouse by targeting the Rosa26 
locus in embryonic stem cells (Supplementary Fig. 3a, b). The 
pancreas-specific expression and function of the conditional SB13 
allele was confirmed (Supplementary Fig. 3c), and we found that 
SB13-induced transposition by itself did not promote lethality or 
pancreatic tumorigenesis (Fig. la and Supplementary Fig. 4a). In 
contrast, Kras'S'-612P. ; Pdxl1-cre; T2/Onc; Rosa26-LSL-SB13 mice 
rapidly progressed and succumbed to invasive pancreatic neoplasms 
(Fig. la-c). A cohort of 117 Kras'®!-G!2P. Pdx1-cre; T2/Onc; Rosa26- 
LSL-SB13 mice (Supplementary Fig. 1b) was monitored for tumour 
development, and 103 of these mice were available for full necropsy 
and tissue procurement. The majority of such mice harboured 
multi-focal pancreatic tumours, and 198 distinct primary tumours 
and metastases were subjected to histological and molecular analysis. 
Most mice had invasive carcinomas (66 of 103) that consisted of 
classical mPDA (78.8%) or invasive cystic neoplasms (21.2%); 34.8% 
of mice also contained metastases predominantly in their liver and 
lungs (Supplementary Fig. 4c). The remainder of the mice (37 of 
103) had pre-invasive pancreatic tumours consisting of high-grade 
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Figure 1 | Transposon mutagenesis accelerates murine PDA and targets 
Usp9x. a, Increased mortality of Kras’"©?. Pdx1-cre; T2/Onc; Rosa26-LSL- 
SB13 (KCTSB13) mice compared to the KC conan (containing KrasSiGi2b. 
Pdx1-cre; T2/Onc, Kras'®“-©!?»; Pdx1-cre; Rosa26-LSL-SB13 and Kras‘*“” Gib, 
Pdx1-cre mice) (172 versus 257 ‘days, P<0.001; long-rank test). The wild- type 
(WT) cohort comprises Kras'®¥-©!?°, T2/Onc; Rosa26-LSL-SB13 and Pdx1-cre; 
T2/Onc; Rosa26-LSL-SB13 mice. b, c, Invasive cystic neoplasm (b) and mPDA 
(c) in Kras'S9!; Pdx1-cre; T2/Onc; Rosa26-LSL-SB13 mice. Scale bar: 


mPanIN and cyst-forming papillary neoplasms (Supplementary 
Fig. 4b). 

The candidate genes identified from the SB13 screen represented 
unanticipated candidates as well as many genes and pathways previ- 
ously implicated in human PDA (Table 1 and Supplementary Tables 2, 
3a and 4). Indeed, various members of the TGF-B pathway, including 
Smad3, Smad4, Tgfbr1 and Tgfbr2, were collectively mutated in 32% of 


Table 1 | Top 20 candidate CIS genes that cooperate with Kras@17° to 
promote mPDA in Kras-*-¢779; Pdx1-cre; T2/Onc; Rosa26-LSL-SB13 


mice 
Gene Chr ClSpeaklocation ClSheight n | Mutation in humans 
Usp9x xX 12691773 158.1266 101 341 eo 
Pten 19 32872602 64.5204 61 96 = 
Fndc3b 3 27562591 13.7096 55 67 = 
Setd5 6 113057997 35.6176 52 71 - 
Arfip1/Fbxw7 3 84769635 21.6666 48 80 Yes (ref. 5) 
Fam193a 5 34705809 243555 45 78 a 
Ctnnal 8 35342868 20.2017 45 50 aa 
Magil 6 93859940 3.3715 43 57 = 
Mkin1 6 31414109 6.5263 41 53 - 
Pum1 4 130288478 2.7948 41 46 Yes (ref. 5) 
Farp1 4 121587858 9.407 39 47 = 
Foxp1 6 98921646 9.5831 38 60 - 
Aridla 4 133268936 32.1628 38 47 Yes (ref. 5) 
Acvr1b 5 101024934 31.1752 38 47 Yes (ref. 15) 
Map4k3 a 81109860 3.2385 38 45 Yes (ref. 5) 
Stag2 Xx 39535994 6.8613 37 48 = 
MII5 5 22982314 6.0001 37 43 Yes (ref. 5) 
Atxn2/Sh2b3 5 122267680 2.3174 37 41 a 
Arhgap5 2 53644560 37.416 35 61 = 
Gsk3b 6 38106972 21,79 35 43 - 


ClSs were ranked by tumour frequency where the spatial distribution of insertion sites was analysed using 
the narrowest 15K kernel scale. Chr, chromosome; CIS height, estimate of the number of insertions within a 
specific genomic region as a result of summing the kernel functions present in that region; /, total number of 
insertions of the CIS in the indicated tumours; n, number of tumours from which the CIS was found. 


12,750,000 


100 ptm. d, Usp9x is the major CIS in Kras'$“-@!?. Pdx1-cre; T2/Onc; Rosa26- 
LSL-SB13 PDA tumours (x axis denotes genome, y axis -log P value), with 
bidirectional insertions. (+) indicates parallel to Usp9x expression; (—) 
indicates antiparallel. e, Usp9x exon 2-T2/Onc chimaeric mRNA in SB13 
tumours. f, g, Usp9x protein expression in normal pancreatic ducts (arrow) 
(f), but not in neoplastic cells (arrows) (g), in SB13 PDA harbouring Usp9x 
insertions. Scale bar: 100 um. 


the tumours. Also, the Rb-p16Ink4a pathway was disrupted in 21% of 
the tumours. CISs representing the orthologues of additional human 
PDA genes included Fbxw7 (24.2%), Aridla (19.1%), Acvr1b (19.1%), 
Stk11 (also called Lkb1) (6.5%), MII3 (6%), Smarca4 (6%) and Pbrm1 
(4.5%)°"*"*. Trp53 was the only commonly mutated PDA gene con- 
spicuously absent, although the p53 regulatory deubiquitinase Usp7 
was a CIS (6.5%)'®. Several CISs previously noted in insertional muta- 
genesis screens for hepatocellular carcinoma or gastrointestinal tract 
adenomas, but not typically mutated in PDA, were also identified in 
this study, including Zbtb20, Nfib and Ube2h in liver tumours"°, and 
Pten, Tcf12, Ppp1r12a and Ankrd11 in gastrointestinal tract adenoma/ 
adenocarcinoma". This indicates that many tumour progression 
pathways may be common to pancreatic, liver and gastrointestinal/ 
colorectal tumours. 

Unexpectedly, the most frequent CIS observed was the X-linked 
deubiquitinase Usp9x, a gene that had not been previously associated 
with PDA or other types of carcinoma in humans or mouse models. 
Indeed, the COSMIC data base revealed only one USP9X mutation ina 
case of ovarian cancer, although the functional relevance of this muta- 
tion has not been characterized (COSMIC mutation ID 73237). Usp9x 
was disrupted in over 50% of all tumours, with 341 insertions noted in 
the 101 tumours harbouring this CIS (Fig. ld and Table 1). 
Furthermore, Usp9x was also identified as a CIS in four samples from 
the initial SB10 screen (Supplementary Table 1), supporting its can- 
didacy as a PDA genetic determinant. We confirmed that Usp9x was 
disrupted in tumours by isolating chimaeric fusion mRNAs that 
spliced the Usp9x transcript to the T2/Onc transposon (Fig. le). In 
addition, the Usp9x protein was specifically absent in neoplastic cells 
in pancreatic tumours bearing intragenic insertions (Fig. 1f, g). 

To characterize the cellular and molecular pathways affected by 
Usp9x in PDA, we used RNA interference to deplete Usp9x levels in 
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mPDA cell lines (Supplementary Fig. 5a). Although Usp9x depletion 
did not affect the proliferation of monolayer cultures (Supplementary 
Fig. 5b), it significantly increased colony formation in soft agar (Fig. 2a, 
Supplementary Fig. 5c) compared to cells transfected with scrambled 
short hairpin RNAs. Furthermore, knock down of Usp9x potently 
suppressed anoikis in mPDA cells (Fig. 2b). These properties of 
Usp9x were predominantly dependent on its intrinsic deubiquitinase 
activity (Supplementary Fig. 6a, b). 

Because USP9X was previously reported to positively regulate 
SMAD4 transcriptional activity'’ and SMAD4 is commonly mutated 
in PDA‘, we hypothesized that Usp9x loss would attenuate Smad4 
function or TGF-B responsiveness in PDA cell lines. However, irre- 
spective of Usp9x expression level, mPDA cell lines expressed Smad4 
and were equally sensitive to p21 induction, growth inhibition and 
morphological alterations after exposure to TGF-$1 (Supplementary 
Fig. 7). Therefore, we were unable to ascribe a specific role to Usp9x in 
the regulation of the Smad4-TGF-B pathway in mPDA cells or 
tumours. 

We next investigated several additional proteins reported to be 
regulated by Usp9x and involved in pathways relevant to cellular 
transformation. Although USP9X has been shown to bind to and 
regulate two proteins involved in cell survival, ASK1 (ref. 18) and 
MCLI (refs 9, 19), we could not detect obvious changes in Askl or 
Mcll protein levels upon Usp9x loss (Fig. 2c). Usp9x has also been 
reported to deubiquitinate and thereby stabilize the E3 ligase Itch”®; 
decreased protein levels of Itch were observed in mouse and human 
PDA cells upon the depletion of Usp9x (Fig. 2c and Supplementary 
Fig. 8a). Notably, ectopic Itch expression was sufficient to promote 
anoikis in mPDA cells (Fig. 2d), and Itch was partially responsible for 
the ability of Usp9x to promote anoikis and suppress colony formation 
(Supplementary Fig. 6c, d). Because Itch is known to promote the 
degradation of several proteins relevant to cell proliferation and 
survival’', we evaluated the protein expression of likely candidates 
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Figure 2 | Usp9x regulates PDA cellular transformation and Itch. 

a, b, Usp9x knock down promotes anchorage-independent growth in three 
mPDA cell lines (a), and decreases anoikis denoted by cleaved caspase 3 (CC3) 
(b). The mean and s.e.m. of one representative experiment performed in 
triplicate are shown in a (***P < 0.001; Mann-Whitney U-test). S, scramble; 
U, Usp9x. c, Usp9x knock down decreases Itch levels but not Ask1 or Mcll 
levels. Changes in Itch are more evident in suspension cultures, and the slower 
migrating band has the expected mobility of mono-ubiquitinated Itch. 

d, Ectopic Itch induces anoikis. B, pBabe-neo; I, pBabe-neo-Myc-Itch. 
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including c-Jun, p63 and c-Flip but observed no alterations (Sup- 
plementary Fig. 8b). Furthermore, the Itch gene was identified as a 
CIS in 13% of cases (Supplementary Table 2). Therefore, Usp9x 
mutation may promote tumorigenesis in part by disabling Itch func- 
tion, and the Usp9x-Itch pathway may work to constrain pancreatic 
tumorigenesis. 

To determine whether USP9X expression is aberrant in human 
PDA, three distinct patient cohorts were assessed. First, we analysed 
a cohort of 100 Australian patients who underwent surgery for loca- 
lized PDA and had detailed information available concerning clinical- 
pathological characteristics and outcome (Supplementary Fig. 9 and 
Supplementary Tables 5 and 6). Tumour DNA from 88 patients in this 
cohort failed to yield somatic mutations in USP9X, consistent with pre- 
vious reports’ (data not shown). Notably, the low expression of USP9X 
mRNA correlated with poor survival after surgery (P = 0.0076) (Fig. 3a), 
and multivariate analysis revealed that USP9X expression was an inde- 
pendent poor prognostic factor after surgery (Supplementary Table 7). 
We next analysed autopsy specimens from a separate cohort of 42 
American patients to determine that USP9X protein expression inversely 
correlated with a widespread metastatic pattern (P = 0.0212) (Fig. 3b), 
and bore no relation to SMAD4 expression (Supplementary Table 8). A 
third collection of PDA specimens obtained from resected German 
patients (n = 404) was used to determine that USP9X and ITCH 
protein levels were decreased (Supplementary Fig. 10) and expressed 
in a similar manner (Spearman-Rho correlation = 0.47; P< 0.01; Sup- 
plementary Table 9a) in tumours compared to normal pancreatic 
tissue. Furthermore, the proportion of tumours that had undetectable 
USP9X (13.6%) or ITCH (30.5%) protein correlated with a worse 
outcome (Supplementary Fig. 11, Supplementary Table 9b, c), particu- 
larly regarding USP9X in the subset of high-grade tumours (Fig. 3c 
and Supplementary Tables 10 and 11). Collectively, these findings 
implicate the loss of USP9X expression as a relevant event in human 
pancreatic cancer progression. 

We found that USP9X was expressed throughout murine and 
human tumour development and lost focally in PDAs (Supplemen- 
tary Figs 12 and 13). Additionally, human PDA cell lines expressed 
lower levels of USP9X compared to non-PDA cancer cell lines 
(Supplementary Fig. 14). To investigate additional potential mechan- 
isms of USP9X regulation in PDA, human cell lines were treated with 
the DNA methylase inhibitor 5-aza-2'-deoxycytidine and the HDAC 
inhibitor trichostatin A. Both inhibitors modestly increased the 
USP9X mRNA and protein levels in most cell lines, indicating that 
USP9X may be epigenetically silenced in vivo (Fig. 3d and Supplemen- 
tary Fig. 15). Furthermore, although the promoter region of USP9X 
was not heavily methylated in tumour samples or PDA cells harbour- 
ing low protein expression (data not shown), treatment with 5-aza-2'- 
deoxycytidine did decrease colony formation of human PDA cells and 
this was partially reversed by concomitantly knocking down USP9X 
(Supplementary Fig. 16). 

To confirm that Kras°!”? cooperated with Usp9x loss to promote 
pancreatic cancer, a conditional Usp9x" allele was generated (Sup- 
plementary Fig. 17a) and interbred with Kras’*’"°'7?; Pdx1-cre mice 
to evaluate the impact on mPanIN progression. The mosaic expression 
of Usp9x in pancreas from Pdx1-cre; Usp9x” mice was confirmed by 
immunohistochemistry (Supplementary Fig. 17b). We found that all 
hemizygous male mice and heterozygous female mice carriers of the 
Usp9x" allele in the background of Kras’*’°'??; Pdxl-cre rapidly 
developed advanced mPanIN and microinvasive neoplasms within 
3 months of age (Fig. 3e, f and Supplementary Fig. 18). Immuno- 
histochemical analysis of mPanINs from heterozygous female mice 
demonstrated absence of Usp9x expression in the pre-neoplastic 
and neoplastic cells (Supplementary Fig. 18), indicative of additional 
events such as X inactivation of the other locus in female mice’. 
mPanINs in Krast8!-G12D. Pdx1-cre; Usp 9x" mice expressed intra- 
nuclear Smad4, similar to Kras’S“"°!7?; Pdxl-cre mice (Supplemen- 
tary Fig. 19a). Additionally, early passage pancreatic cell cultures 
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Figure 3 | USP9X loss promotes PDA. a-c, Decreased USP9X expression 

correlates with shortened survival in an Australian post-surgical cohort (a) (8.7 
versus 18.4 months, P = 0.0076; log-rank test), increased metastatic burden in 
an American autopsy series (b) (54% versus 19%, P = 0.0212; Fisher’s exact 

test), and diminished survival in a German post-surgical cohort (c) (11.1 versus 
16.1 months, P = 0.037; log-rank test). d, Trichostatin A (TSA, red) and 5-aza- 
2'-deoxycytidine (AZA, blue) modestly increase USP9X mRNA expression ina 
panel of eight human PDA cell lines. The mean and s.e.m. of one representative 


prepared from these mice confirmed the absence of the Usp9x protein 
and altered regulation of Itch (Supplementary Fig. 19b). Although 
some mice died of local or metastatic pancreatic cancer, aggressive 
oral papillomas often required the culling of young mice and demon- 
strated that Kras?’?? and Usp9x loss also cooperated to transform 
keratinocytes (Supplementary Fig. 19c). 

Although a recent report implicated USP9X as a pro-survival gene 
by stabilizing MCL1 (ref. 9), potential inhibitors of USP9X should be 
developed with caution as we find that Usp9x has tissue-specific effects 
including a tumour suppressor role in oncogenic Kras-initiated 
pancreatic carcinoma. USP9X is probably epigenetically silenced in a 
subset of PDA, thus explaining why previous DNA sequencing efforts 
have failed to identify this as a participant in carcinogenesis, and 
indicating that clinically available epigenome modulators may be 
beneficial agents in such patients. ITCH is a likely mediator of 
pancreatic tumour suppression by USP9X, and continued investiga- 
tion of the USP9X-ITCH pathway is warranted. More generally, the 
identification of Usp9x through the use of transposon mutagenesis 
reaffirms the utility of in vivo mouse cancer screens to complement 
the direct investigation of human cancer. 


METHODS SUMMARY 

Kras'"C!2> (ref. 24), Pdxl-cre (ref. 8), T2/Onc (ref. 6), CAGGS-SB10 (ref. 6) and 
Rosa26-LSL-SB13 strains were interbred to generate Kras¥-G?P. Pdxl-cre, 
Kras’S¥-6P, Pdx1-cre; T2/Onc; CAGGS-SB10 and Kras'S¥-¢, Pdx-1-cre; T2/ 
Onc; Rosa26-LSL-SB13 compound mutant mice. Non-quadruple mutant mice 


Oligometastatic Widely metastatic 
(n = 16) 


0 J USP9X negative 


(n= 26) 0 20 40 60 80 100 
; Time (months) 
At risk 
Positive 153 37 13 1 (0) 
Negative 23 3 0 


Normal 


PanIN3 


experiment performed in triplicate are shown. e, Usp9x deletion promotes 
mPanIN progression in Kras'®¥-©!??. Pdx1-cre; Usp9. ’* and Kras'S-G?P, 
Pdxl-cre; Usp9x”” (KCU) mice (P < 0.0001; Fisher’s exact test). 

f, Representative normal pancreas (Pdx1-cre; Usp9x ; CU), mPanIN1 
(Kras'S° G2. Pdx1-cre; KC), mPanIN3 (Kras'SG?2P, Pdx1-cre; Usp9x<" ’* and 
Kras'S¥G!??. Ddx1-cre; Usp 9x" ¥; KCU) and microinvasive mPDA (KCU, 
arrow, circled). Scale bar: 100 um. 


represented the comparison cohorts. Kras'“'!?? and Pdxl-cre mice were 
interbred with Usp9x" mice to generate the Kras''°!; Pdx1-cre; Usp9x"* 
and Kras’““¢. Pdx1-cre; Usp9x" Vy compound mutant mice, as well as the two 
control cohorts Pdx1-cre; Usp9x”" and Kras'S'"°!?: Pdxi-cre. Usp9x" mice were 
generated by Ozgene Pty. Ltd. Mice were maintained in compliance with the UK 
home office regulations. Splinkerette PCRs were performed as described previ- 
ously*”*. Reads from sequenced tumours were mapped to the mouse genome 
assembly NCBI m37 and merged together to identify SB insertion sites, as previ- 
ously described”. Redundant sequences, as well as insertions in the En2 gene and 
in the T2/Onc donor concatemer resident chromosome (chromosome 1), were 
removed. Mouse survival curves and cell culture experiments were analysed with 
the GraphPad prism program. The IHC histoscoring from the TMA samples and 
Kaplan-Meier survival curves were analysed with SPSS18, and the Spearman-Rho 
correlation coefficient (two-sided) between USP9X and ITCH was calculated. The 
IHC USP9X histoscore and analysis was conducted using Fisher’s exact test on 
post-mortem samples. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Mouse strains. Kras“*“"°!?? (ref. 24), Pdx1-cre (ref. 8), T2/Onc (ref. 6), CAGGS- 
SB10 (ref. 6) and Rosa26-LSL-SB13 strains were interbred to generate 
Kras'SG12D ; Pdxl-cre,; T2/Onc,; CAGGS-SB10 (KCTSB10) and Kras'-G12P ' 
Pdx-1-cre; T2/Onc; Rosa26-LSL-SB13 (KCTSB13) compound mutant mice. 
Non-quadruple mutant mice represented the comparison cohorts. Genomic 
DNA from tumours developed in KCTSB10 and KCTSB13 mice was obtained 
using the Puregene Core Kit A (Qiagen) and splinkerette PCRs were performed as 
described previously*”*. For the KCU cohort, Kras’¥6? and Pdx1-cre mice 
were interbred with Usp9x" mice to generate the Kras’SG?P.. Pdx1-cre; 
Usp9, ’* and Kras’Si Gb, Pdx1-cre; Usp9x™ ” (KCU) compound mutant mice, 
as well as the two control cohorts Pdx1-cre; Usp9x¢ “% (CU) and Kras’S© 9”; Pdx1- 
cre (KC). 

Generation of Rosa26-LSL-SB13 knock-in mice. TL! ES cells”’ were electropo- 
rated with linearized pRosa26-LSL-SA-SB13-BGHpolyA targeting construct and 
correctly targeted puromycin-resistant clones were identified by Southern blot. 
Two positives clones exhibiting a normal karyotype were used to generate chimaeric 
mice by microinjection into C57BL/6 blastocysts. Germline transmission of the 
targeted allele was confirmed by Southern blot analysis of tail DNA from the agouti 
offspring. 

T2/Onc excision PCR. Genomic DNAs were obtained from Pdx1-cre; T2/Onc; 
Rosa26-LSL-SB13 and T2/Onc; Rosa26-LSL-SB13 mice and primers used to assess 
the excision of the T2/Onc concatemer in the Pdx1-cre; T2/Onc; Rosa26-LSL-SB13 
mice were 5'-TGTGCTGCAAGGCGATTA-3’ and 5’-ACCATGATTACGCC 
AAGC-3’, 

CIS analysis. For the statistical analysis, redundant sequences, as well as insertions 
in the En2 gene and in the T2/Onc donor concatemer resident chromosome 
(chromosome 1), were removed and 90,007 non-redundant insertion sites 
(Supplementary Table 3) were used to identify CISs using a Gaussian kernel 
convolution framework (GKC)**. Reads from sequenced tumours were mapped 
to the mouse genome assembly NCBI m37 and merged together to identify SB 
insertion sites, as previously described*’. An enhanced version of the framework 
was developed for SB screens to account for the local density of TA sites within the 
genome”. For example, a genomic region containing a large number of insertion 
sites but a low density of TA sites is considered to be significant and thereby 
identified as a candidate CIS. Conversely, a region with a large number of insertion 
sites but also containing a high density of TA sites is determined to be less sig- 
nificant, as the transposons have more ‘target’ sites into which they can integrate. 
Multiple kernel scales were used in the GKC framework (widths of 15K, 30K, 50K, 
75K, 120K and 240K nucleotides). CISs predicted across multiple scales and 
overlapping in their genomic locations were clustered together, such that the 
CIS with the smallest genomic ‘footprint’ was reported as the representative 
CIS. For highly significant CISs with narrow spatial distributions of insertion sites, 
the 15K kernel is typically the scale on which CISs are identified. Additional 
statistical analysis of insertion sites was performed using a Monte Carlo frame- 
work’®, CISs were compared to previously published data sets of human pancreatic 
cancer genetics*”?*°. 

Detection of Usp9x-T2/Onc fusion mRNA by RT-PCR in SB tumours. Total 
RNA was extracted from snap-frozen SB tumours using the RNeasy Mini kit 
(Qiagen), and total RNA (1 |g) was reverse transcribed into cDNA using the 
High Capacity RNA-to-cDNA kit (Applied Biosystems). RT-PCR was carried 
out with a nested PCR approach using primers of mouse Usp9x exon 1 and the 
Carp-f-actin splice acceptor sequence of the T2/Onc transposon cassette. CDNA 
was used as a template in a first round of PCR using specific primers correspond- 
ing to exon 1 of Usp9x (5’-GAGTCTGCGCTGCCGCTGCTG-3’) and Carp-p- 
actin splice acceptor sequence (5'-CATACCGGCTACGTTGCTAA-3’). The 
product of this reaction was used as a template in a second round of nested 
PCR using an internal primer in the Usp9x exon 1 (5’-GCTGCCGCTG 
CTGTTGCTGC-3’) and a second primer in the Carp-f-actin splice acceptor 
sequence (5'-ACGTTGCTAACAACCAGTGC-3’). PCR products were cloned 
into pCR 2.1-TOPO vector (Invitrogen) and positives clones sequenced. 
Plasmids, shRNAs and transfections. pSuperRetro-PURO retroviral vector 
(Oligoengine) expressed a short hairpin against mouse and human USP9X 
(5'-GATGAGGAACCTGCATTTC-3’), mouse Itch (5'-GACCTGAGAAGACG 
TTTGT-3')” and a scrambled sequence (5’-GCGCGCTTTGTAGGATTCG-3’). 
pBabe-zeo-Ecotropic receptor (ecoR) was obtained from Addgene (plasmid 
no. 10687). Myc-mltch cDNA was released from pCINeo-Myc-Itch (Addgene 
plasmid no. 11427), and was subcloned in the retroviral vector pBabe-neo 
(Addgene plasmid no. 1767). KCU1 and KCU2 cell lines were transfected 
with pEF-DEST51-mUsp9x(WT)-V5 and pEF-DEST51-mUsp9x(C1566S)-V5 
plasmids***. The plasmid pEF/GW-51/LacZ (Invitrogen) was used as control. 
Transfections were done using Lipofectamine 2000 (Invitrogen). Twenty-four 
hours later, cells were selected with 5 pg ml! blasticidin (Invitrogen). 
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Cell culture. Tumour pancreatic cancer cell lines were established from 
Kras'8"6P. Pdxl-cre (T4878 and T9394), Kras’“¢?. P48-cre (TB1572) and 
Kras'S-6P., Pdx-1-Cre; Usp9x" (KCU1 and KCU2) mice as described previ- 
ously**. Cells were subsequently cultured in DMEM (Invitrogen), supplemented 
with 10% FCS (Hyclone). The normal human pancreatic ductal cell line HPDE 
was provided by M. Tsao and cultured as described previously***. The human 
pancreatic cancer cell lines AsPC1 (CRL-1682) and BxPC3 (CRL-1687) were 
acquired from ATCC and cultured according to instructions. The other cell lines 
were obtained from Clare Hall Laboratories (CRUK). The human cell lines Pancl, 
MiaPaCa2, 818.4, Hs766T, PATU2, SUIT2, FA6 and MDA-Panc3 (PDA); CaCO2 
and SW1116 (colorectal cancer); SKBR3 (breast cancer) and A549 (lung cancer) 
were cultured in DMEM supplemented with 10% FCS. The human cell lines U937 
(histiocytic lymphoma), RAMOS (Burkitt’s lymphoma), NCI-H2179 (lung cancer) 
and ZR75-1 (breast cancer) were cultured in RPMI (Invitrogen) supplemented with 
10% FCS. Cells were treated with 1 1M trichostatin A (Sigma) for 24h or with 5 uM 
5-aza-2'-deoxycytidine (Sigma) for 96h where indicated to obtain RNA and 
protein lysates to assess USP9X expression. For anchorage-independent growth 
assay, cells were treated with 5 uM 5-aza-2'-deoxycytidine (Sigma). 

Retroviral infections. Phoenix cells were plated 24 h before transfection using the 
ProFection Mammalian Transfection System Calcium Phosphate (Promega). 
Target cells were infected with retroviruses produced in the Phoenix packaging 
cells (24 and 48h after transfection) in the presence of 81g ml ' polybrene 
(Sigma) and were selected with 2,1gml~' puromycin (Sigma) or 1mg ml’ 
G418 (Clontech). Experiments were performed using at least two independent 
cell line infected pools. Human PDA cells lines Pancl, SUIT2 and PATU2 infected 
with retroviral vectors expressed the ecotropic receptor (ecoR). 
Transformation, anoikis and EMT assays. Cell lines (1.5 X 10° cells) were plated 
in triplicate in 12-well plates and counted as indicated using a Z2 Coulter 
(Beckman). Cells were fed every other day. Anchorage-independent growth was 
assessed by colony formation in soft agar. Briefly, 15,000 cells were plated in 
duplicate in DMEM with 15% serum and 0.34% low-melting-point agarose 
(LMP, BioGene) onto 6-cm dishes coated with 0.5% LMP. Cells were fed twice 
a week and grown for 2 weeks. Colonies were counted in nine different X20 fields. 
For the anoikis assay, 10° cells per 0.5 ml were plated in 24-well ultra-low cluster 
plates (Costar) to allow them to grow in suspension for 4 days. Cells were collected, 
washed with cold PBS and protein lysates were obtained. Cell line T4878 was 
cultured in matrigel as previously described*’, plating 1,000 cells per well. Cells 
were fed every 2 days and grown for 4 days. Epithelial-to-mesenchymal transition 
(EMT) was determined by plating 10° cells per 6-well plates for 24h to allow 
attachment, followed by treatment with human TGF-B1 (5ng ml ') (RD 
Systems) for 24h. p21 induction was assessed after treatment with human TGF- 
Bl (5ng ml ') (RD Systems) for 2h. 

Real-time PCR. Total RNA from human PDA cell lines was extracted using the 
RNeasy Mini Kit (Qiagen), and total RNA (1 jig) was reverse transcribed into 
cDNA using the High Capacity RNA-to-cDNA kit (Applied Biosystems). 
Human USP9X expression was analysed by quantitative PCR (qPCR) using 
TaqMan gene expression assays Hs00245009_m1 (Applied Biosystems) on a 
7900HT Real-Time PCR system (Applied Biosystems). Gene expression was 
normalized to human ACTB expression, assessed with the gene expression assays 
Hs99999903_m1 (Applied Biosystems), and shown relative to control samples. 
Western blot analysis. Cells were washed three times in cold PBS and lysed with 
boiling lysis buffer (1% SDS, 10 mM, pH7.5 Tris, 50mM NaF, 1 mM Na3VOu,). 
Lysates were boiled for 5 min, passed through a 26 gauge needle to shear genomic 
DNA and centrifuged for 10 min at 14,000 r.p.m. Equivalent amounts of protein 
were resolved in 4-12% gradient SDS-PAGE gels (Invitrogen), transferred to 
Immobilon-P transfer membranes (Millipore), and incubated with the corres- 
ponding antibodies including anti-Askl (NB110-55482, Novus Biologicals), 
anti-Mcll (5453, Cell Signaling), anti-Usp9x (A301-351A, Bethyl), anti-CC3 
(9664, Cell Signaling), anti-Itch (611198, BD), anti-p21 (sc-6246, Santa Cruz), 
anti-Smad4 (sc-7966, Santa Cruz), anti-Myc tag (2276, Cell Signaling), anti V5 
tag (R960-25, Invitrogen), anti-c-Flip (ALX-804-127, Enzo Life Sciences), anti-c- 
Jun (9165, Cell Signaling), anti-p63 (Ab110038, Abcam), anti-o1-Tubulin (T6074, 
Sigma) and anti-Actin (sc-1616, Santa Cruz Biotechnology). Reactive bands were 
visualized with ECL plus reagent (Amersham). Relative expression was quantified 
with Image Quant TL software (GE Healthcare). 

Immunohistochemistry. Formalin-fixed paraffin-embedded (FFPE) mouse tis- 
sues were cut into 3-j1m tissue sections, and antigen retrieval was performed in 
10 mM, pH6.0 citric acid (for Usp9x and E-cadherin) or 10 mM, pH 8.0 EDTA 
(for Smad4). Endogenous peroxidases were quenched in 3% HO,/PBS for 20 min. 
Signal detection for immunohistochemistry was accomplished with biotinylated 
secondary antibodies (Vector Laboratories) using the Elite Vectastain ABC kit and 
peroxidase substrate DAB kit (Vector Laboratories). Primary antibodies used were 
anti-Usp9x, 1:200 (A301-351A, Bethyl); E-cadherin, 1:200 (610182, BD) and 
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anti-Smad4, 1:100 (sc-7966, Santa Cruz). Slides were counterstained with 
haematoxylin. 

Clinical patient samples immunohistochemistry and analysis. Tissue micro- 
arrays (n = 404) were prepared from patient samples obtained after appropriate 
informed consent in Dresden (Institute of Pathology, University Hospital 
Dresden), Regensburg (Institute of Pathology, University Hospital Regensburg) 
and Jena (Institute of Pathology, University Hospital Jena). Informed consent was 
obtained for each patient, following review by the human ethics committee 
Ethikkommission an der Technischen Universitat Dresden. The PDA tumour 
samples were collected from 1993 to 2009, and most of the patients (65%) did 
not undergo adjuvant chemotherapy. Those that did undergo adjuvant therapy 
(35%) were chiefly treated with 5FU or gemcitabine-based regimens, but in this 
subgroup there was no significant increase in patient survival. The median survival 
times of patients after surgery from each centre were indistinguishable. 
Immunohistochemistry was performed on 5 jm sections that were prepared using 
silanized slides (Menzel Glaser). Staining was performed with the Benchmark 
System (Ventana), using rabbit anti-USP9X antibody, 1:200 (A301-351A, 
Bethyl) and anti-ITCH, 1:200 (611198, BD); and the protocol UltraView HRP, 
with the CC1 modified protocol as pre-treatment. Slides were counterstained with 
haematoxylin. Staining intensities were scored as absent (0), weak (1), medium (2) 
and strong (3). For further analysis the staining intensities were grouped as nega- 
tive (0) and positive (1-3). The Cox regression model assumption of proportional 
hazard was tested using a plot of the cumulative hazards function. 

A second cohort of patient samples was obtained from the Gastrointestinal 
Cancer Rapid Medical Donation Program in the Department of Pathology at 
Johns Hopkins Hospital, USA. Use of all human tissue samples from resection 
specimens and autopsy participants was approved by Johns Hopkins Institutional 
Review Board, and obtained after informed consent. All samples were collected 
within 12h post mortem and formalin fixed before paraffin embedding. Five- 
micrometre sections were cut from matched primary and metastasis samples onto 
glass slides. Slides were first incubated in Dako Target Retrieval Solution for 
antigen retrieval. Slides were then incubated with rabbit anti- USP9X antibody, 
1:1,000 (ab26334, Abcam) or 1:200 (NBP1-48321, Novus Biologicals), and anti- 
SMAD4 as previously described**. Signal detection for immunohistochemistry 
was accomplished with Dako LSAB+System-HRP. Slides were counterstained 
with haematoxylin. 

An additional cohort of pancreatic cancer resection samples was prospectively 
acquired through the Australian Pancreatic Cancer Network and the Australian 
Pancreatic Cancer Genome Initiative (http://www.pancreaticcancer.net.au/apgi). 
Consent was obtained for genomic sequencing through the Australian Pancreatic 
Cancer Genome Initiative (APGI) for each individual patient following approval 
from Human Research Ethics Committees (HREC) at participating sites (Sydney 
South West Area Health Service HREC Western Zone, 2006/054; Sydney Local 
Health Network HREC RPA Zone, X11-0220; and North Sydney Central Coast 
Health, Harbour HREC, 0612-251M). We extracted RNA from tumour samples 
using the Qiagen Allprep kit (Qiagen) in accordance with the manufacturer's 
instructions, assayed for quality on an Agilent Bioanalyzer 2100 (Agilent 


Technologies), and subsequently hybridized to Iumina Human HT-12 V4 micro- 
arrays. Raw idat files were processed using IlluminaGeneExpressionIdatReader 
(M. Cowley et al., manuscript in preparation). After array quality control, these 
data were vs.t transformed, and then robust spline normalized, using the lumi 
R/Bioconductor package. For the ICGC/APGI cohort, we assumed a proportional 
hazard: that the probability of death is the same for those censored as for those 
remaining on study. 

For the TMA and expression array cohorts, median survival was estimated 
using the Kaplan-Meier method and the difference was tested using the log-rank 
test. P values of less than 0.05 were considered statistically significant. For the 
TMA cohort, as few parameters were significant in univariate analysis, all were 
initially considered for Cox Proportional Hazard multivariate analysis in a 
backward elimination model, and assessed with the SPSS18 Software (IBM) with 
overall survival used as the primary endpoint. For the ICGC/APGI cohort, clinico- 
pathological variables analysed with a P value of less than 0.25 on log-rank test 
were entered into Cox proportional hazard multivariate analysis and the model 
was resolved using backward elimination. Statistical analysis was performed using 
StatView 5.0 Software (Abacus Systems). Disease-specific survival was used as the 
primary endpoint. 
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Control of a Salmonella virulence locus by an 
ATP-sensing leader messenger RNA 


Eun-Jin Lee’? & Eduardo A. Groisman!? 


The facultative intracellular pathogen Salmonella enterica resides 
within a membrane-bound compartment inside macrophages’. This 
compartment must be acidified for Salmonella to survive within 
macrophages’, possibly because acidic pH promotes expression of 
Salmonella virulence proteins**. We reasoned that Salmonella 
might sense its surroundings have turned acidic not only upon 
protonation of the extracytoplasmic domain of a protein sensor® 
but also by an increase in cytosolic ATP levels, because conditions 
that enhance the proton gradient across the bacterial inner 
membrane stimulate ATP synthesis®’. Here we report that an 
increase in cytosolic ATP promotes transcription of the coding 
region for the virulence gene mgtC, which is the most highly induced 
horizontally acquired gene when Salmonella is inside macrophages*. 
This transcript is induced both upon media acidification and by 
physiological conditions that increase ATP levels independently of 
acidification. ATP is sensed by the coupling/uncoupling of tran- 
scription of the unusually long mgtC leader messenger RNA and 
translation of a short open reading frame located in this region. A 
mutation in the mgtC leader messenger RNA that eliminates the 
response to ATP hinders mgtC expression inside macrophages 
and attenuates Salmonella virulence in mice. Our results define a 
singular example of an ATP-sensing leader messenger RNA. 
Moreover, they indicate that pathogens can interpret extracellular 
cues by the impact they have on cellular metabolites. 

The mgtC gene is required for survival inside macrophages and 
for growth in low Mg** media in several phylogenetically unrelated 
intracellular pathogens including Salmonella enterica®, Yersinia 
pestis'®, Brucella suis'', Burkholderia cenocepacia’* and Mycobacterium 
tuberculosis'*. In Salmonella, mgtC heads the mgtCBR operon, which 
specifies the inner membrane protein MgtC, the Mg” transporter 
MgtB and the MgtR peptide promoting MgtC degradation”. 
Transcription from the mgtCBR operon is controlled by the virulence 
regulatory system PhoP/PhoQ”*. Expression of mgtC must be tightly 
regulated for a normal course of infection because inactivation of the 
mgtC gene attenuates Salmonella virulence in mice’, whereas prevent- 
ing transcription of AmgR, a PhoP-dependent anti-sense RNA that 
promotes the preferential degradation of the mgtC portion of the poly- 
cistronic mgtCBR message, renders Salmonella hypervirulent’’. 

Mild acidic pH induces expression of the mgtC’’ (Supplementary 
Fig. 1) and mgtB'® genes in a phoP-dependent manner in wild-type 
Salmonella. This could be ascribed to acidic pH sensing by PhoQ, the 
cognate sensor of PhoP®. However, Salmonella seems to use a different 
mechanism to promote mgtC expression in response to mild acidic 
pH: when bacteria were switched from media at pH 7.7 to media at pH 
5.1, the messenger RNA (mRNA) levels of the mgtC (Fig. 1a) and mgtB 
(Fig. 1b) coding regions increased even ina phoP* phoQ’ strain, which 
lacks the pH sensor PhoQ and harbours a PhoP allele that can function 
in its absence’’. This stimulation seems to be specific to mgtC because 
the mRNA levels corresponding to the mgtA coding region did not 
increase (Fig. 1c), despite the mgtA promoter also being under direct 
transcriptional control of PhoP”. 
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Figure 1 | Mild acidic pH promotes transcription of the mgtC and mgtB 
coding regions in a Salmonella strain lacking the extracytoplasmic pH 
sensor PhoQ. Relative mRNA levels of the coding regions of the mgtC 

(a), mgtB (b) and mgtA (c) genes produced by a Salmonella phoP* phoQ strain 
(EG10232) or a derivative with conserved adenine nucleotides at position 44- 
46 in the mgtCBR leader substituted by thymine nucleotides (EL486). Bacteria 
were grown in N-minimal media with 500 1M Mg’~ at pH 7.7 for 1 h and then 
for an additional 1 h in the same media at pH 5.1 in the absence or presence of 
0.5 tM of the protonophore carbonyl cyanide 3-chlorophenylhydrazone 
(CCCP). mRNA levels of target genes were normalized to that of 16S ribosomal 
RNA rrs gene. The mean and s.d. from two independent experiments are 
shown. d, Intracellular ATP levels and pH of Salmonella (EG10232) grown in 
media with pH 7.7 and 1h after being switched to media with pH 5.1 to an 
attenuance at 600 nm (Deoonm) of 0.453. Values are in picomoles of ATP per 
millilitre of cells per Déoo nm. The mean and s.d. from two independent 
measurements are shown. 
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The stimulation of mgtC expression promoted by mild acidic pH 
could be mediated by an increase in the proton gradient across the 
inner membrane, hence creating a change in cytosolic ATP. Indeed, 
the intracellular pH dropped by ~0.9 units and the ATP concentration 
rose ~2.5-fold within 1h of switching the phoP* phoQ strain from 
media at pH 7.7 to media at pH 5.1 (Fig. 1d). Dissipation of the proton 
gradient with a protonophore decreased ATP levels (Fig. 1d) and pre- 
vented induction of mgtC and mgtB at pH 5.1 (Fig. 1a, b). (A similar 
drop in intracellular pH was shown by wild-type Salmonella; that is, 
from pH 8.15 + 0.112 in media with pH 7.7 to pH 7.25 + 0.079 in 
media with pH 5.1 and to pH 5.04 + 0.031 in media with pH 5.1 in the 
presence of the protonophore.) 

That cellular ATP is the signal upregulating mgtC transcription when 
Salmonella experiences mild acidification is supported by two additional 
experiments performed with strains carrying plasmid pGFP303, which 
harbours the nucleotide sequence corresponding to the natural mgtCBR 
promoter and leader region fused to a promoterless gfp gene at the mgtC 
start codon. Fluorescence was sixfold higher when a purine auxotroph 
was grown in defined media with high adenine than with low adenine 
(Fig. 2a). The higher fluorescence is not due to differences in growth 
rates (data not shown) or a decrease in cellular pH (Fig. 2b), but reflects 
the higher ATP levels present in bacteria grown at the higher adenine 
concentration (Fig. 2b). Moreover, it is specific to ATP because a change 
in uridine levels in the growth media did not alter the fluorescence 
of a uridine auxotroph harbouring pGFP303 (data not shown). 
Furthermore, it requires mgtCBR regulatory sequences because changes 
in the adenine concentration in the media did not affect the fluor- 
escence produced by the adenine auxotroph harbouring the plasmid 


vector (Fig. 2a). And when wild-type Salmonella harbouring pGFP303 
was grown in glucose, fluorescence was two times higher than when it 
was grown in glycerol (Fig. 2c), reflecting the larger ATP amounts 
generated when glucose is used as carbon source (Fig. 2d). 
Weanalysed the mgtCBR leader mRNA seeking sequence and struc- 
tural features that might indicate how it might sense ATP. We iden- 
tified two short open reading frames (ORFs)—designated mgtM and 
mgtP—preceded by putative ribosome-binding sites (Fig. 3a). Here, we 
focus on mgtM because, as subsequently described, it mediates the 
response to ATP. We determined that mgtM is translated in vivo 
(Supplementary Fig. 2) and that similarly sized ORFs preceded by 
potential ribosome-binding sites are conserved in the mgtCBR leader 
regions from other enteric bacteria (Supplementary Fig. 3). The 
sequences adjacent to and including mgtM have the potential to adopt 
two alternative structures—stem-loops A and B—in Salmonella 
(Fig. 3a) and other examined species (Supplementary Fig. 3). In-line 
probing experiments verified the formation of stem-loop B in the wild- 
type mgtCBR leader RNA (Supplementary Fig. 4) and of stem-loop A 
in a mutant leader RNA with the Gy;—>C substitution, which is pre- 
dicted to hinder formation of stem-loop B (Supplementary Fig. 4). 
The deduced amino-acid sequence of mgtM is not conserved in other 
species (Supplementary Fig. 5), but its length and location relative to 
stem-loop A, as well as the presence of adenine nucleotides near the 
base and within stem-loop A, are (that is, positions 44-46 and 56-57; 
Fig. 3b). Because the left arm of stem-loop A includes the last four 
mgtM codons (Fig. 3a), translation of the complete mgtM is anticipated 
to hinder formation of stem-loop A and favour formation of stem- 
loop B. Therefore, changes in intracellular ATP levels might affect the 
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Figure 2 | ATP promotes gene transcription in a manner dependent on 
conserved adenine nucleotides in the mgtCBR leader region. a, Fold change 
in fluorescence produced by an adenine auxotroph (EG9652) harbouring the 
plasmid vector pfpv25, pGFP303 or derivatives with substitutions of conserved 
adenine nucleotides at position 44-46 by thymine nucleotides (pGFP303 A4a- 
4e6—>T), guanine nucleotides (pGFP303 Ay446—G) or cytosine nucleotides 
(pGFP303 A44 46—C), with substitutions of conserved 55-56 adenine 
nucleotides by guanine nucleotides (pGFP303 As¢_s7—>G), or with substitution 
of most of the mgtM sequence except for adenine nucleotides at 44-46 and 55- 
56 (pGFP303 mgtMgcramblea)- Plasmid pfpv25mgtA harbours a transcriptional 
fusion between the PhoP-dependent mgtA promoter and mgtA leader sequence 
and a promoterless gfp gene. Bacteria were grown in N-minimal media with 
10 uM Mg’* in the presence of either 25 uM (low) or 250 uM (high) adenine. 
Fluorescence was monitored as described in Methods. Fold change was 
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calculated by dividing the fluorescence of cells grown in high adenine by that of 
cells grown in low adenine. Note that the ratio of fluorescence shown at high 
and low adenine is the same for bacteria harbouring pfpv25mgtA or the vector 
pfpv25. Data correspond to a representative of four independent experiments 
conducted in duplicate. b, Intracellular ATP levels and intracellular pH of purB 
Salmonella (EG9652) grown in the presence of the indicated concentrations of 
adenine were determined by labelling with phosphorus-32 and then separating 
by thin-layer chromatography. c, Fluorescence produced by wild-type 
Salmonella (14028s) harbouring pGFP303 or pGFP303 A44 46 after growth 
for 4h in N-minimal media with 10 1M Mg*” in the presence of either 38 mM 
glycerol (~0.35%) or 0.2% glucose as a carbon source. GFP, green fluorescent 
protein. d, Intracellular ATP levels of cells grown in the indicated carbon 
sources to Deo nm = 0.419 were determined by luminescence as described in 
Methods. The mean and s.d. from two independent measurements are shown. 
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Figure 3 | Regulation of the Salmonella mgtCBR virulence operon by the 
PhoP/PhoQ system and mgtCBR leader region. a, The sensor PhoQ responds 
to extracytoplasmic low Mg”, acidic pH and antimicrobial peptides by 
promoting phosphorylation of the PhoP protein, which binds to the mgtCBR 
promoter, resulting in transcription initiation. Acidic pH also produces a proton 
gradient across the inner membrane, resulting in higher intracellular ATP levels. 
Intracellular ATP levels control transcription elongation into the mgtCBR coding 
regions by affecting the coupling/uncoupling of transcription of the mgtCBR 
leader and translation of the short ORF mgtM, which affects the formation of 
alternative stem-loops A or B. The mgtM ribosome-binding site is underlined. 
Positions and sequences of stop codon mutations or nucleotide substitutions in 
the strains used in the experiments presented in c and d are indicated and denoted 
below the mgtM sequence. b, Conservation of adenine nucleotides in mgtM 
relative to stem-loop structures in the mgtC leader. Alignment of the nucleotide 
sequences corresponding to mgtM from the indicated species. Sequences in bold 
correspond to mgtM. The mgtM ribosome-binding sites are underlined. Asterisks 
correspond to nucleotides conserved in all listed species. Conserved adenine 
nucleotides are coloured in orange; sequences with a potential to adopt stem-loop 
structures are underlined. c, }-galactosidase activity (Miller units) produced by 


Salmonella with a chromosomal mgtC-lac fusion (EG9527) and isogenic 
derivatives with mutation of the start codon (EG19269) or with stop codons at 
different positions (EG19285, EG19289, EG19293 and EG19298) in mgtM. 
Bacteria were grown in N-minimal media containing low (10 LM) or high 

(10 mM) Mg”* for 4h. d, B-galactosidase activity (Miller units) produced by 
Salmonella with a chromosomal mgtC-lac fusion (EL92) and isogenic derivatives 
with Cg7—G and Gos—C substitution (EL96) or mutation of the start codon in 
mgtM (EL97) or both (EL98). Bacteria were grown as described in c. The mean 
and s.d. from three independent experiments are shown in ¢ and d. 

e, B-galactosidase activity produced by Salmonella with a chromosomal mgtC-lac 
fusion harbouring either plasmid psupF or the empty vector pUH21-2lacI* 
(EL86, EL87) and isogenic derivatives with an amber stop codon (EL90, EL91) or 
ochre stop codon (EL88, EL89) at position 39-41 of mgtM. Bacteria were grown 
as described except in the presence of ampicillin and 0.2 mM IPTG. 

f, B-galactosidase activity produced by Salmonella with a chromosomal mgtC-lac 
fusion (EG9527) or an isogenic strain with a stop codon at position 39-41 in 
mgtM (EG19289) harbouring either the vector or plasmid pmgtM. Bacteria were 
grown as described except in the presence of ampicillin and 0.1 mM IPTG. The 
mean and s.d. from two independent experiments are shown in e and f. 
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coupling/uncoupling of transcription of the mgtCBR leader with trans- 
lation of mgtM. This would determine whether stem-loop A or B 
forms, thereby dictating whether transcription continues into the 
mgtCBR coding regions. This is analogous to the mechanism by which 
cytoplasmic UTP levels control expression of the pyrimidine bio- 
synthetic gene pyrBI in Escherichia coli, except that low UTP promotes 
pyrBI transcription when RNAP pauses at a U-rich segment of the 
pyrBI leader mRNA” whereas we propose that high ATP furthers 
expression of the mgtCBR coding regions. 

In agreement with the notion that a transcription attenuation-like 
mechanism*”’ regulates transcription elongation into the mgtCBR 
coding region, a strain with a chromosomal mutation in the mgtM 
start codon and harbouring a Jac fusion in the mgtC coding region 
produced over four times more B-galactosidase than the isogenic strain 
with the wild-type mgtC leader when grown in low Mg’ * to induce the 
PhoP/PhoQ system’* (Fig. 3c, d). Likewise, strains harbouring stop 
codons at the fourth, sixth or seventh positions of mgtM (Fig. 3a) also 
derepressed mgtC-lac expression (Fig. 3c). This derepression is due to 
a defect in mgtM translation (as opposed to resulting from an effect on 
the structure of the mgtC leader mRNA): a plasmid expressing the 
amber suppressor supF restored normal mgtC-lac transcription to 
an mgtM mutant with an amber stop codon at the sixth position but 
not to an isogenic mutant with an ochre stop codon at the same 
position (Fig. 3e). As expected, the supF-expressing plasmid had no 
effect on mgtC-lac transcription in a strain harbouring the wild-type 
mgtC leader (Fig. 3e). An mgtM derivative with a stop codon at the 
ninth position failed to express mgtC-lac (Fig. 3c), possibly because 
translation of mgtM beyond the eighth codon would result in forma- 
tion of stem-loop B (Fig. 3a). Translation of mgtM exerts its regulatory 
effect on the associated mgtCBR coding regions in cis (as opposed to 
mgtM encoding a trans-acting peptide) because a plasmid expressing 
the mgtM ORF failed to restore normal mgC-lac expression to an 
mgtM stop codon mutant, behaving like the vector control (Fig. 3f); 
and it had no effect on a strain with a wild-type mgtC leader (Fig. 3f). 
Our data indicate that whenever translation stops before the ribosome 
reaches the ninth mgtM codon, stem-loop A forms, which promotes 
expression of the mgtCBR coding region; and when mgtM is translated 
beyond a certain position, stem-loop B is favoured, which reduces 
transcription of the mgtCBR coding region (Fig. 3a). As predicted, 
the pronounced mgtC-lac derepression shown by the mgtM start 
codon mutant (Fig. 3c, d) was eliminated by the simultaneous intro- 
duction of Cg7—>G and G,;-—>C substitutions (Fig. 3d), which favours 
formation of stem-loop B (Fig. 3a). 

The phenotypes described above are not an artefact resulting from 
the absence of a functional mgtC gene in the mgtC-lac strains. This is 
because mutation of the mgtM start codon derepressed mgtC levels in 
an otherwise wild-type Salmonella that experienced mild acidification 
(Supplementary Fig. 6), and because introduction of a stop codon at 
the ninth position of mgtM silenced mgtC expression in an isogenic 
strain subjected to the same conditions (Supplementary Fig. 6). 

The conserved adenine nucleotides at positions 44-46 or 56-57 of 
the mgtCBR leader (Fig. 3a) are critical for the response to ATP: on the 
one hand, the mild acid induction of the mgtC and mgtB genes 
disappeared in a strain with a chromosomal mutation at position 
44-46 in the mgtCBR leader (Fig. 1a, b). Likewise, growth in glucose 
no longer promoted higher fluorescence than growth in glycerol in 
wild-type Salmonella carrying a pGFP303 derivative with the adenine 
nucleotides at position 44-46 substituted by thymine nucleotides 
(Fig. 2c). Yet, this mutant leader retained a wild-type structure (Sup- 
plementary Fig. 7) and ability to respond to Mg** (Supplementary 
Fig. 8)”. Moreover, changes in the adenine concentration in the media 
failed to affect the fluorescence produced by the purine auxotroph with 
derivatives of pGFP303 where the adenine nucleotides at position 
44-46 were replaced by thymine nucleotides, guanine nucleotides or 
cytosine nucleotides, or with a plasmid with the adenine nucleotides at 
position 56-57 replaced by guanine nucleotides (to avoid destabilizing 
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stem-loop A, the last two nucleotides were not replaced by thymine 
nucleotides or cytosine nucleotides; Fig. 3a) (Fig. 2a). On the other 
hand, an engineered mgtCBR leader with a scrambled mgtM sequence 
that retained the adenine nucleotides at positions 44-46 and 56-57 
had a wild-type response to ATP (Fig. 2a). The RNAs from the wild- 
type and ATP-blind mutant mgtC leader had indistinguishable in vitro 
profiles when investigated at two pHs (Supplementary Fig. 7b). This 
provides further support to the notion that the increase in mgtCBR 
expression resulting from mild acidification is mediated by cytosolic 
ATP levels as opposed to protons being sensed directly by the mgtC 
leader mRNA. Cumulatively, these data provide a singular example of 
a bacterial mRNA leader that senses ATP, using a mechanism that is 
different from those previously described in eukaryotic organisms***>. 

We wondered whether ATP sensing by the mgtCBR mRNA leader is 
required for Salmonella virulence given that mgtC is expressed in a 
variety of tissues during infection in several different animal hosts**°** 
and that inactivation of the mgtC gene hinders survival inside macro- 
phages and virulence in mice’. We established that when Salmonella 
was inside the macrophage-like cell line J774.A.1, the mRNA levels of the 
mgtA and mgtC leaders region rose in parallel (Supplementary Fig. 9), 
which probably reflects activation of their respective promoters during 
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Figure 4 | Expression of the mgtC coding region inside macrophages is 
dependent on its leader region’s ability to sense ATP, a property required for 
Salmonella virulence. a, Relative mRNA levels of the mgtC, mgtB and mgtA 
coding regions of wild-type Salmonella harbouring the wild-type mgtCBR 
leader (14028s) inside J774 A.1 macrophages at the indicated times after 
infection. b, The same as in a except using a mutant Salmonella with the 
chromosomal adenine nucleotides at position 44—46 of the mgtCBR leader 
substituted by thymine nucleotides (EL341). ¢, Survival of C3H/HeN mice 
inoculated intraperitoneally with ~10* colony-forming units of wild-type 
Salmonella (14028s) or ATP-sensing defective mutant (EL341) or deleted for 
both the mgtC and mgtB coding regions (EL6). 


©2012 Macmillan Publishers Limited. All rights reserved 


infection”. By contrast, the mRNA levels corresponding to the mgtC 
and mgtB coding regions increased markedly (that is, tenfold by 9h 
after internalization relative to Salmonella grown in tissue culture 
media; Fig. 4a), but no induction was observed for the coding region 
of the mgtA gene (Fig. 4a), which is not required for Salmonella 
virulence’. This might reflect differences between the mgtCBR (Fig. 3a) 
and mgtA* leaders. Indeed, the mRNA levels of the mgtA coding region 
did not increase under conditions promoting higher ATP levels (Fig. 2a) 
or mild acidification (Fig. 1c). Induction of the mgtC and mgtB coding 
regions inside macrophages requires the ability of the mgtCBR leader 
mRNA to sense ATP because it was defective in a chromosomal mutant 
with the adenine nucleotides at position 44-46 replaced by thymine 
nucleotides (Fig. 4b). This mutant was attenuated for virulence after 
intraperitoneal inoculation of mice (Fig. 4c), albeit not as much as a 
strain deleted for the mgtC and mgtB coding regions (Fig. 4c), implying 
that other signalling inputs remain functional in this mutant. 

Cumulatively, our findings demonstrate that the mgtCBR leader 
senses cytosolic ATP levels, thereby enabling differential control of the 
mgtCBR operon from that of other PhoP-activated genes. This property 
is critical for Salmonella virulence, possibly because Salmonella resides 
within a phagosome that is mildly acidic’, which is a condition that can 
generate high ATP levels in the bacterium®”. Finally, our data highlight 
how pathogens can interpret host-derived signals, such as acidic pH, by 
the changes they experience in their cellular constituents. 


METHODS SUMMARY 


Bacterial strains and plasmids used in this study are listed in Supplementary Table 1. 
AILS. enterica serovar Typhimurium strains were derived from the wild-type strain 
14028s and were constructed by phage P22-mediated transductions as described. 
All DNA oligonucletides are listed in Supplementary Table 2. Detailed descriptions 
of examining the effect of pH or ATP on gene expression, determining intracellular 
ATP levels, examining expression inside macrophage and mouse virulence assays 
are in Methods. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Bacterial strains, plasmids, oligodeoxynucleotides and growth conditions. 
Bacterial strains and plasmids used in this study are listed in Supplementary 
Table 1. All S. enterica serovar Typhimurium strains were derived from the 
wild-type strain 14028s*' and constructed by phage P22-mediated transductions 
as described**. All DNA oligonucletides are listed in Supplementary Table 2. 
Bacteria were grown at 37°C in Luria-Bertani broth, N-minimal media (pH 
7.7)° supplemented with 0.1% casamino acids, 38 mM glycerol and the indicated 
concentrations of MgCl). To examine the effect of pH or carbon source on gene 
expression, we used a modified N-minimal medium containing 0.2% glucose 
instead of 38 mM glycerol. E. coli DH5« was used as the host for preparation of 
plasmid DNA. Ampicillin was used at 50 pg ml, chloramphenicol at 20 pg ml, 
kanamycin at 20 jig ml’, tetracycline at 10 pg ml‘ and fusaric acid at 12 pg ml. 
Effect of pH on gene expression. Cells were grown overnight in N-minimal 
medium containing 10 mM Mg”*. Dilution (1/100) of the overnight culture was 
used to inoculate 20 ml of the same medium and grown for 3h. Cells were then 
washed and transferred to 20 ml of N-minimal medium containing 500 1M Mg** 
and grown for 1 h. The cells were collected and washed with N-minimal medium 
containing 500 uM Mg’* and an aliquot corresponding to 1/10 volume of cells for 
pH 7.7 before switching to pH 5.1. Then, cells were resuspended in 20 ml of 
N-minimal medium containing 500 1M Mg*~ at pH 5.1 with or without 0.5 uM 
carbonyl cyanide 3-chlorophenylhydrazone and growth continued for 1h. 
Bacteria were stabilized using RNAprotect Bacteria Reagent (Qiagen) and RNA 
was isolated for further analysis. 

Effect of exogenous adenine on gene expression. The adenine auxotrophic 
strain EG9652 harbouring plasmid pGFP303, its derivatives, pfpv25mgtA or the 
plasmid vector were grown overnight in N-minimal medium containing 10 mM 
Mg’*, 500 [1M adenine and ampicillin. One millilitre of the overnight culture was 
washed in the N-minimal medium without Mg”* and adenine and resuspended in 
1 ml of the same media. The suspended bacteria were inoculated 1/50 volume in 
N-minimal medium with 10 1M Mg** and ampicillin in the presence of either 25 
or 250 uM adenine. Fluorescence and Dgoo nm of the cultures were monitored for 
6.5h with shaking 37°C in a Victor’ plate reader (Perkin Elmer). To prevent 
evaporation, the 96-well plate was covered with mineral oil. GFP expression of a 
given strain was determined by plotting fluorescence over D¢oo nm- Note that the 
purB strains were freshly transduced before each experiment to prevent the isola- 
tion of suppressor mutations. 

Measurement of intracellular ATP in Salmonella. We measured intracellular 
ATP levels using a luminometer (BioTek Synergy H1) as described with modifica- 
tion”. Briefly, bacteria were grown overnight in N-minimal media containing 10 mM 
Mg** and 0.2% glucose as a carbon source. One millilitre of the overnight culture was 
washed in the N-minimal medium without Mg”* and resuspended in 1 ml of the 
same media. Diluted (1/100) bacteria were inoculated in 2 ml of N-minimal media 
containing 10 1M Mg** (with proper antibiotics if necessary) and grown for 4h. Cells 
were normalized by D¢oo nm and resuspended in 500 pl of phosphate-buffered saline 
(PBS). Nucleic acids were extracted by adding 100 pl of ice-cold 3 M perchloric acid. 
After incubating for 5 min, extracts were neutralized with 225 pl of neutralization 
buffer (1 M KOH, 0.5 M Tris, 0.5 M KCl) and centrifuged for 10 min. Fifty microlitres 
of the supernatant were diluted with 50 pil of L buffer (25 mM KCl, 50 mM MgSO,, 
100mM HEPES pH 7.4) and intracellular ATP was measured using an ATP 
Determination Kit (Invitrogen) according to the manufacturer’s instruction. 
Intracellular ATP levels (picomoles per millilitre of cells at given Dgoo nm) Were con- 
verted using reference to standards of known concentration. 

To measure intracellular ATP levels in response to exogenous adenine levels 
(Fig. 2b), purB Salmonella were grown overnight in N-minimal medium contain- 
ing 10 mM Mg” " and antibiotic. One millilitre of the overnight culture was washed 
in the N-minimal medium without Mg** and adenine and resuspended in 1 ml of 
the same media. Diluted (1/50) suspended bacteria were inoculated in modified 
N-minimal media containing 10 1M Mg** and 0.3 mM inorganic phosphate and 
grown for 6.5h. Then, 120 Ci of phosphorus-32 were added and labelled for 
15 min. The labelled nucleoside 5’-triphosphates were extracted by formic acid 
extraction as described previously and analysed by PEI-cellulose thin-layer chro- 
matography plate*’. The levels of intracellular ATP were quantified by a Typhoon 
FLA-9000 phosphorimager (GE Healthcare). 

Examining gene expression inside macrophages. Macrophage infection was 
performed as described* with the following modifications: J774 A.1 macrophages 
were seeded in six-well plates in RPMI medium (Invitrogen) supplemented with 
10% heat-inactivated fetal bovine serum, 2 mM L-glutamine and 10 mM HEPES at 
a density of 10° per well one day before infection with Salmonella. Bacteria were 
grown overnight in Luria-Bertani broth media at 37 °C, washed with PBS and used 
to infect macrophages at a multiplicity of infection of 100:1. Plates were centrifu- 
gated at 1,000g for 5 min (defined as time 0) and incubated for 1 h for phagocytosis. 
Extracellular bacteria were killed with RPMI media containing 50 pg ml! 


gentamicin. The media were replaced after 1h with the same media containing 
10pgml~' gentamicin and incubation continued as indicated in the legend to 
Fig. 4a, b. At each time point, infected macrophages were lysed and stabilized with 
Tri reagent (Applied Biosystems) and RNA was extracted according to the 
manufacturer’s instruction. Control RNA from Salmonella cultured in tissue 
culture media was obtained as described previously*. Salmonella mRNA levels 
at each time point were normalized by the mRNA levels after growth in tissue 
culture media. To estimate the intracellular bacteria at each time point, cells were 
lysed with PBS containing 0.1% Triton X-100 and plated on Luria-Bertani broth 
plate with proper dilutions, or DNA was extracted from the same RNA sample 
according to the manufacturer’s instructions (Applied Biosystems) and measured 
by quantitative real time PCR. 

Mouse virulence assays. Six- to eight-week-old female C3H/HeN mice were 
inoculated intraperitoneally with ~10* colony-forming units. Mouse survival 
was followed for 21 days. Virulence assays were conducted three times with similar 
outcomes and data correspond to groups of five mice. All animals were housed in 
temperature- and humidity-controlled rooms, maintained on a 12h light/12h 
dark cycle. All procedures complied with regulations of the Institutional Animal 
Care and Use Committee of the Yale School of Medicine. 

Construction of plasmids harbouring lacZ fusions to mgtM. PCR fragments 
corresponding to nucleotides 1-59 of the mgtCBR leader were amplified with 
primer 9801, which includes the sequence corresponding to the plac,_s promoter, 
and either primer 9802 or 9803 (creating a stop codon) using 14028s genomic 
DNA asa template. The resulting PCR products were digested with Smal and Xbal 
and cloned into plasmid pACYC-'lacZ digested with the same enzymes. The 
sequences of the resulting constructs were verified by DNA sequencing. 
Construction of plasmids harbouring fusions to a promoterless gfp gene. 
pGFP303, a plasmid with the PhoP-dependent mgtCBR promoter and the wild- 
type mgtC leader fused to a promoterless gfp gene, was constructed as follows: a 
PCR fragment generated with primers 1746 and 8117 using 14028s genomic DNA 
as a template and digested with EcoRI and Xbal was cloned into plasmid pfpv25 
digested with the same enzymes. 

Derivatives of pGFP303 with nucleotide substitution in the mgtCBR leader 
region were constructed by cloning PCR fragments generated by two rounds of 
PCR reactions. For the Ay4-4s—>T substitution in the mgtC leader, a first PCR 
fragment was generated with primers 1746 and 11727, and a second fragment was 
generated with primers 11726 and 8117 using 14028s genomic DNA as a template. 
A third PCR was performed with primers 1746 and 8117 using the two PCR- 
generated DNA fragments as templates. The resulting PCR product was cloned 
into pfpv25 using the same restriction enzymes used for construction of pGFP303. 
All other substitutions were generated in a similar way using the following primer 
pairs: Ayy4¢>G (1746/11725 and 11724/8117), Agy4¢>C (1746/11399 and 
11398/8117), Ase-s7>G (1746/11723 and 11722/8117) and mgtMgcrambled 
(1746/11950 and 11949/8117). 

pfpv25mgtA, a plasmid with the PhoP-dependent mgtA promoter and wild- 
type mgtA leader fused to a promoterless gfp gene, was constructed as follows: a 
PCR fragment was generated with primers 6208 and 11737 using 14028s genomic 
DNA asa template and digested with EcoRI and Xbal and then cloned into pfpv25 
digested with the same enzymes. The sequences of the resulting constructs were 
verified by DNA sequencing. 

Construction of a plasmid harbouring the mgtM ORF. Plasmid pmgtM was 
constructed as follows: a PCR fragment corresponding to the mgtM gene was 
generated by PCR with primers 10074 and 10075 using 14028s genomic DNA 
as a template, digested with HindIII and BamHI and cloned into pUHE 21-2lacI* 
digested with the same enzymes. 

Construction of a strain with chromosomal deletion of the mgtC and mgtB 
genes. A Salmonella strain deleted for the mgtC and mgtB genes was generated by 
the one-step gene inactivation method’. A Cm® cassette was PCR amplified from 
plasmid pKD3 using primers 1908 and 1911 for the deletion of the mgtC and mgtB 
genes; the resulting PCR products were integrated into 14028s chromosome to 
generate strain EG16736 (mgtCB::Cm*). The mgtCB deletion (EL6) strain was 
made by removing the antibiotic resistance cassette from EG16736 using plasmid 
pCP20 as described”. 

Construction of strains with chromosomal mutations in the mgtCBR leader 
region. Two different methods were used to generate strains with chromosomal 
mutations in the mgtCBR leader. To create stop codon mutants in mgtM and with 
nucleotide substitution in the mgtCBR leader region, we used the fusaric acid 
method as described'*. We introduced a Tet” cassette in the mgtCBR leader region 
as follows: we generated a 1990-base-pair PCR product harbouring the tetRA 
genes using as template chromosomal DNA from strain MS7953s and primers 
7310/7370. The product was purified and used to electroporate 14028s or EG9527 
Salmonella containing plasmid pKD46. The resulting mgtCBR leader::tetRA 
strains containing plasmid pKD46 (EG18715 and EG18798) were kept at 30°C. 
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Then, we replaced the tetRA cassette in the mgtCBR leader region using DNA 
fragments carrying a mutation to create nucleotide substitutions. This was 
accomplished by preparing a DNA fragment harbouring nucleotide substitutions 
in the mgtCBR leader region by a two-step PCR process. 

To create strains with stop codon mutations in mgtM, DNA fragments carrying 
stop codons in mgtM were prepared as follows: we used two primer pairs 8118/ 
8769 and 8770/7308 (for UAG), 8118/8878 and 8880/7308 (for UAA1), 8118/8881 
and 8879/7308 (for UAA2), 8118/8986 and 8987/7308 (for UGA3), 8118/8882 and 
8883/7308 (for UAA4) or 8118/9851 and 9850/7308 (for UAG2) and 14028s 
genomic DNA as a template in the first PCR reaction. For the second PCR 
reaction, we mixed the two PCR products from the first PCR reaction as templates 
and amplified the DNA fragment with expected substitution using primers 8118 
and 7308. The resulting PCR products were purified and integrated into the 
EG18798 or EG18715 (for EG19307 and EG19322) chromosome and selected 
against Tet* with media containing fusaric acid to generate strains EG19269, 
EG19285, EG19289, EG19293, EG19298, EL90 (or EL91), EG19307 and 
EG19322, Tet® Amp* chromosomal mutants, respectively. 

DNA fragments carrying the A44—46 to T substitution in the mgtC leader were 
prepared by a two-step PCR reaction. For the first PCR reaction, we used two 
primer pairs 8118/11727 and 11726/7308, and 14028s genomic DNA as a 
template. For the second PCR reaction, we mixed the two PCR products from the 
first PCR reaction as templates and amplified the DNA fragment with expected 
substitution using primers 8118 and 7308. The resulting PCR products were purified 
and integrated into the EG18715 chromosome and selected against Tet® with media 
containing fusaric acid to generate EL341, a Tet* Amp® chromosomal mutant. The 
presence of the expected substitution was verified by sequencing. 

All other chromosomal mutants with substitutions in the mgtCBR leader were 
constructed by a multiple-step PCR process. Strain EL92 was constructed by 
inserting a Cm’ cassette in the yicL gene, which is 278 nucleotides upstream from 
mgtC transcription start site. The Cm" cassette was amplified from plasmid pKD3 
using primers 4801 and 4802 and the resulting PCR products were integrated into 
EG9527 chromosome to generate EL92 (yicL::Cm*). Then, we prepared DNA 
fragments containing a Cm* cassette and the proper nucleotide substitutions in 
the mgtCBR leader using two primer pairs and EL92 genomic DNA as a template: 
primer pairs 10077/8265 and 8264/7308 for the C87->G G95—C substitution, 
and primer 10077/8769 and 8770/7308 for UAG mutation in the mgtM. The two 
resulting DNA fragments from the first PCR reactions were mixed and used as 
PCR templates to amplify DNA fragments containing Cm™ cassette and the proper 
nucleotide substitution using 10077 and 7308. For EL98, DNA fragments were 
generated using primer pairs 10077/8265 and 8264/7308 and EL97 genomic DNA 
as a template. The resulting DNA fragments were purified and integrated into 
EG9527 chromosome by the one-step inactivation method** and mutants were 
selected for resistance to chloramphenicol. The presence of the expected substi- 
tution was verified by DNA sequencing. 

Quantitative RT-PCR. Total RNA was isolated using RNeasy Kit (Qiagen) 
according to the manufacturer’s instructions. The purified RNA was quantified 
using a Nanodrop machine (NanoDrop Technologies). Complementary DNA 
(cDNA) was synthesized using High Capacity RNA-to cDNA Master Mix 
(Applied Biosystems). The mRNA levels of the mgtC, mgtB, mgtA and rrs genes 
were measured by quantification of cDNA using Fast SYBR Green PCR Master 
Mix (Applied Biosystems) and appropriate primers (mgtC leader: 6962/6963; 
mgtC coding: 7530/7531; mgtB coding: 7763/7764; mgtA leader: 7225/7226; 
mgtA coding: 4308/4309) and monitored using a Fast ABI7500 machine 
(Applied Biosystems). Data were normalized to the levels of 16S ribosomal 
RNA amplified with primers 6970 and 6971. 

B-galactosidase assays. Cells were grown overnight in N-minimal media and 
washed twice in N-minimal media before resuspending them in N-minimal media 
with different MgCl, concentrations for 4h at 37 °C with shaking. The activity was 
determined as described*’. Data correspond to two independent experiments 
conducted in duplicate. 
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Determining intracellular pH in Salmonella. We measured intracellular pH 
using green fluorescent protein as described with modification**. Bacteria 
harbouring a plasmid containing the gfp gene expressed from heterologous pro- 
moter (pfpv25.1) were grown as shown in Figs 1 and 2b. Cells were normalized by 
Dgo0 nm and resuspended in 150 il of PBS in a 96-well black microplate. Excitation 
spectra were measured at 30°C from 480 to 510 nm (slit width, 2 nm), using an 
emission wavelength of 545 nm (slit width, 20 nm) by a Synergy H1 plate reader 
(BioTek). Spectra were measured for three biological replicates at each pH. A 
standard curve was determined for green fluorescent protein by measuring fluor- 
escence of samples resuspended in the same buffer at pH 5.5, 6.0, 6.5, 7.0 or 7.5 
with addition of 20 mM sodium benzoate, a permeant acid equilibrating cytoplas- 
mic pH with external pH. 

In-line probing. Experiments were performed as described” with the following 
modifications: the mgtC leader RNA was synthesized in vitro with a T7 RiboMAX 
Large Scale RNA production system (Promega) from the DNA template amplified 
from wild-type 14028s and primers (8675/11562) for the mgtC leader 1-195. DNA 
templates with Gos—>C, A44_4¢>T or As6_s7—G substitutions were prepared by a 
two-step PCR reaction. For the first PCR reaction, we used two primer pairs 
8675/8175 and 8174/11562 for Gg;—>C substitution, 8675/11727 and 11726/ 
11562 for Ay446—T substitution, 8675/11723 and 11722/11562 for As¢_s57>G 
substitution and 14028s genomic DNA as a template. For the second PCR 
reaction, we mixed the two PCR products from the first PCR reaction as templates 
and amplified the DNA fragment with expected substitution using primers 8675 
and 11562. To probe the structures at different Mg”* concentrations, 1 pmol of 
5'-end-labelled mgtC leader RNA was incubated in buffer (100 mM KCl, 50 mM 
Tris (pH 8.3)) with 1, 5 or 20mM Mg” * for 40h at room temperature. Reactions 
were quenched with urea gel loading buffer II (Ambion) and analysed on a 10% 
denaturing polyacrylamide gel. 

Examining the effect of pH on RNA structure. To determine pH effect on the 
structure of the mgtC leader, we used lead (II) acetate as it cleaves single-stranded 
RNAs non-specifically. One picomole of 5'-end-labelled wild-type 1-195 RNA or 
mutant RNA with A444¢—T substitution was incubated in 10 ul of either pH 8.3 
(100 mM KCl, 10mM Tris (pH 8.3), 10mM MgCl,) or pH 7.0 (100mM KCl, 
10mM Tris (pH 7.0), 10mM MgCl) buffer for 10 min at 37°C to allow RNA 
folding. Then, 1 pl of fresh solution of lead (II) acetate (50 mM) was added and 
incubated for 2 min at 37 °C. Reaction was stopped by adding 10 il of gel loading 
buffer II (Ambion) on ice and analysed on a 10% denaturing polyacrylamide gel. 
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Atomic model of the type III secretion system needle 
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Christian Griesinger', Michael Kolbe*, David Baker”, Stefan Becker’ & Adam Lange! 


Pathogenic bacteria using a type III secretion system (T3SS)'” to 
manipulate host cells cause many different infections including 
Shigella dysentery, typhoid fever, enterohaemorrhagic colitis and 
bubonic plague. An essential part of the T3SS is a hollow needle- 
like protein filament through which effector proteins are injected 
into eukaryotic host cells**®. Currently, the three-dimensional 
structure of the needle is unknown because it is not amenable to 
X-ray crystallography and solution NMR, as a result of its inherent 
non-crystallinity and insolubility. Cryo-electron microscopy com- 
bined with crystal or solution NMR subunit structures has recently 
provided a powerful hybrid approach for studying supramolecular 
assemblies””, resulting in low-resolution and medium-resolution 
models'*”. However, such approaches cannot deliver atomic 
details, especially of the crucial subunit-subunit interfaces, 
because of the limited cryo-electron microscopic resolution 
obtained in these studies. Here we report an alternative approach 
combining recombinant wild-type needle production, solid-state 
NMR, electron microscopy and Rosetta modelling to reveal the 
supramolecular interfaces and ultimately the complete atomic 
structure of the Salmonella typhimurium T3SS needle. We show 
that the 80-residue subunits form a right-handed helical assembly 
with roughly 11 subunits per two turns, similar to that of the 
flagellar filament of S. typhimurium. In contrast to established 


models of the needle in which the amino terminus of the protein 
subunit was assumed to be a-helical and positioned inside the 
needle, our model reveals an extended amino-terminal domain 
that is positioned on the surface of the needle, while the highly 
conserved carboxy terminus points towards the lumen. 

Wild-type S. typhimurium T3SS needles were obtained by in vitro 
polymerization of recombinantly produced full-length PrgI protomers. 
The diameter of the resulting needle-like filaments (Supplemen- 
tary Fig. 1b) agrees with that of T3SS needles from natural sources’. 
The addition of wild-type PrgI protomers also elongated needles of 
isolated S. typhimurium T3SSs up to several micrometres in length 
(Supplementary Fig. 2), showing the functionality of the recombinant 
protein (as previously reported for Prgl* (ref.18)). Recently we 
demonstrated that excellent-quality solid-state NMR (ssNMR) spectra 
of Prgl needles can be obtained by ‘°C spin dilution’, which allowed us 
to achieve the complete ssNMR resonance assignment. As determined 
from conformation-dependent chemical shifts, PrgI adopts a rigid 
conformation (except for Met 1 and Ala2) comprising four distinct 
structural elements (Fig. la): an N-terminal extended domain 
(Thr 3-Tyr 8), an o-helix («1) (Leu 9-Ala 35), a loop (Ala 36-Pro 41) 
and a C-terminal o-helix (#2) (Ala42-Arg 80). Very narrow eC 
linewidths (~0.15-0.3 p.p.m.) and the presence of a unique set of 
NMR signals exclude the possibility of conformational heterogeneity 
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Figure 1 | Structural data from ssNMR. a, Secondary structure analysis. 
Positive and negative secondary chemical shifts are indicative of «-helical and 
B-sheet structure, respectively. b-f, Collection of ssNMR distance restraints. 
b, ’c-¥c PDSD spectra of [1-13C]Glc-labelled T3SS needles. c, °C-13C 
spectra of [2-!3C]Glc-labelled (in magenta) and [(1/2)-!?C]Glc-labelled (in 
blue) T3SS needles. Intersubunit restraints can be obtained from the [(1/ 


2)-3C] Glc-labelled spectrum, for example, Gly 19Ca-Ala 35CB and 

Gly 19Ca-Ala 36Cf. d, a Ces © spectra of [1-'3C]Glc-labelled (in green), 
[2-1?C]Glc-labelled (in magenta) needles. e, PCN spectra of [(1/2)-"°C]Gle- 
labelled (in blue) T3SS needles. Intersubunit restraints are encoded in the 
[(1/2)-!3C]Glc-labelled spectrum: Pro 41C8-Tyr 8CB and Ser 39Ca-Tyr 8CB. 
f, °N-'3C PAIN-CP spectrum of [2-'3C]Glc-labelled T3SS needles. 
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in the wild-type needle state, whereas in X-ray structures of soluble, 
C-terminally truncated protein” or the double-mutant Prgl* (V65A/ 
V67A)"* the first ~18 N-terminal residues are disordered (Sup- 
plementary Fig. 1c). In particular, ssNMR reveals a highly ordered 
N-terminal segment (Thr 3-Tyr 8) with unusual chemical shifts that 
do not correspond to regular o-helical or B-sheet conformation. A kink 
(Val 20-Asn 22) interrupts the helix «1 that is not observed in the 
monomeric X-ray crystal structures'®. In PrgI* needles, B-strand con- 
formation was observed at the C terminus’* (Supplementary Fig. 1c), in 
contrast with the wild-type needle preparation. However, the observed 
structural differences could represent different functional states of the 
filament, as suggested previously®. 

The sparse [1-'3C]Glc or [2-1°C]Gle labelling significantly improves 
the quality of '*C-'°C ssNMR spectra'’, and numerous cross-peaks 
can readily be assigned to long-range distance restraints (see Sup- 
plementary Figs 3-7 and Methods). The [1-'*C]Glc-labelled”’ sample 
(Fig. 1b) provides many methyl-based long-range restraints. Similarly, 
the [2-'°C]Glc-labelled?! sample (Fig. 1c) shows excellent spectral reso- 
lution in the aromatic region, permitting the collection of unambiguous 
restraints. In total, 521 restraints were assigned (including 247 long- 
range ones; see Supplementary Table 1 and Methods), including long- 
range SN-1C restraints (detected in a PAIN-CP (proton-assisted 
insensitive nuclei cross-polarization) spectrum”; Fig. 1f). Roughly 
twice as many unambiguous long-range restraints were found per 
residue than in the current benchmark in ssNMR structural studies, 
the HET-s prion domain”. 

To distinguish intrasubunit from intersubunit distance restraints, 
we compared spectra of [1-'°C]Glc-labelled, [2-!°C]Glc-labelled and 
mixed [(1/2)-'°C]Glc-labelled PrgI needle samples™ (Fig. 1d, e) and 
identified 162 intersubunit and 359 intrasubunit interactions that 
together define the overall organization of the T3SS needle assembly 
(Fig. 2). 

Intrasubunit restraints (Fig. 2, red lines) reveal a helical hairpin 
motif in Prgl. The kink segment (Val 20-Asn 22) disrupts the «1-02 
interactions, with no intrasubunit long-range restraints detected for 
residues Asp 17—Asn 22 and their counterpart Arg58-Asn 59. The 
N-terminal part of «1 is again interacting with the C-terminal part 
of «2. Multiple restraints (Fig. 2, bold lines) were detected between 
Leu9 and Lys 69, the most N-terminal and C-terminal residues that 
take part in the helical hairpin motif. No intrasubunit long-range 
restraints were detected for the N-terminal (Thr3-Tyr8) and 
C-terminal (Asp 70-Arg 80) segments, but the high intensity of their 
NMR signals’ clearly demonstrates their rigidity and well-defined 
structure, indicating that these residues are stabilized through inter- 
subunit contacts as described below. 

The intersubunit distance restraints (Fig. 2, blue lines) reveal a 
lateral interface, mostly formed by helix—helix packing, and an axial 
interface. The lateral interface connects neighbouring PrglI subunits by 
means of %1-«1 and %2-«2 helix—helix packing in an arrangement in 
which the subunits are staggered along the filament axis with an axial 
translation of ~24 A (indicated for example by restraints involving 
Ala60 of one molecule and Ile 76 of an adjacent one). The last five 
C-terminal residues that are known to be essential for the needle 
formation and infectivity, in particular Ile 76 and Phe 79, are involved 
in multiple intersubunit restraints. The central part of «1 helices 
(Asp 17-Leu 23, including the kink) interacts with adjacent subunits 
through the extended N terminus, in particular the Trp 5 aromatic 
ring, and also through the end of helix «1 (Ala35-Lys37) of the 
adjacent subunit. The ssNMR restraints also reveal strong intersubunit 
interactions between the extended N-terminal domain (Thr 3-Tyr 8) 
and the loop region (Ala36-Pro41), defining the axial packing 
between subunits on top of each other (Fig. 2). 

According to previous scanning transmission electron microscopy 
(STEM) measurements”’, the axial rise per subunit in PrgI needles is 
~4,2 A. The intermolecular distance restraints show that laterally 
adjacent subunits are axially translated from one turn to the next by 
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Figure 2 | Architecture of the T3SS needle assembly as determined by 
ssNMR. The dashed lines represent the intrasubunit (in red) and intersubunit 
(in blue) ssNMR distance restraints. Bold lines indicate the detection of more 
than three distance restraints. Lateral interactions between subunits i andi + 5 or 
i+ 6are not distinguishable at this stage of the analysis, but only after first rounds 
of Rosetta calculations (see Methods). N-ter, N terminus; C-ter, C terminus. 


~24 A, which divided by ~4.2 A yields ~5.7 subunits per turn. These 
values are consistent with a helical arrangement in which roughly 11 
subunits are arranged in two turns. To generate three-dimensional 
models of the filament, rather than docking preformed monomeric 
structures which would not capture possible conformational changes 
on assembly, we extended the Rosetta symmetric fold-and-dock pro- 
tocol’® to helical symmetry. Starting from a helically arrayed set of 
extended polypeptide chains, we optimized the total energy of the 
system including the ssNMR restraints and conformation-dependent 
chemical shifts by simultaneously sampling both the internal degrees 
of freedom of the monomers and the five rigid body degrees of freedom 
(the radius of the needle was restrained using the previously published 
cryo-electron microscopy map” ). Models were generated for 9-start, 
1l-start, 13-start and 15-start helices (see Methods); the 11- start 
models had helical parameters more consistent with the STEM data” 
and had more favourable ssNMR restraint energies (Supplementary 
Table 3). 

In an 11-start helical arrangement, interactions between subunits i 
and i+ 11 constitute the axial interface, and subunit i is laterally 
surrounded by subunits i+ 5 and i+ 6. We used first-round models 
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generated with only the unambiguous axial and intramolecular 
restraints to resolve the (5/6) ambiguities in the lateral restraints in 
two successive rounds of structure calculations (Supplementary Fig. 8). 
Whereas the inner «2 helices are parallel in the final models, the «1 
helices present a cross-fenced pattern that emerges from the inclusion 
of two clusters of restraints, one involving Trp 5-Val 20 and the other 
Val 20-Ala 36 lateral interactions (Fig. 3; alternative assignments of 
these restraints resulted in significantly higher energy structures). The 
cross structure is further corroborated by the chemical shift data, 
which indicate a kink at Val 20 (Fig. 1a) as well as the lack of long- 
range intramolecular restraints for the same residue (Fig. 2). The com- 
bination of the ssNMR restraints with all-atom Rosetta refinement 
may yield near-atomic resolution; for example, the turn connecting 
the two helices has a backbone root mean squared deviation (r.m.s.d.) 
to the crystal structure of the monomer of less than 0.5A (Sup- 
plementary Fig. 8g). 

The resulting T3SS needle model (Fig. 3 and Supplementary Fig. 9) 
consists of a ~80-A outer diameter tube with a ~25-A axial lumen 


Figure 3 | Complete atomic model of the T3SS needle. a, b, Ribbon 
representation showing different subunits: side perspective (a); top view 

(b). The N-terminal domains (N-ter) are highlighted by red dashed circles. 
c-e, Selections of the subunit-subunit interfaces. c, The axial interface between 
subunits i and i — 11. d, The lateral interface between subunits i and i+ 6. 

e, The lateral interface between subunits i and i + 5. Blue dashed lines represent 
ssNMR restraints. 
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(Fig. 3b). The N-terminal domain was proposed to be o-helical in 
previous studies of the homologous Shigella flexneri needle’’”’ and 
to form the inner shell of the filament. Our atomic model of the 
T3SS needle clearly diverges from this prediction: the N-terminal 
domain is not o-helical and is located on the exterior surface of the 
needle, not on the inner side. 

To corroborate this finding, we performed immunoelectron micro- 
scopy on both S. typhimurium cells and needles polymerized in vitro 
(Supplementary Figs 10 and 11). S. typhimurium prgI-knockout cells 
complemented with prgI fused to an upstream Strep-tagII were 
labelled with monoclonal anti-Strep antibodies (Supplementary Figs 
11 and 13). The Strep-PrglI is fully functional in HeLa cell invasion 
assays of complemented S. typhimurium prgl-knockout cells (Sup- 
plementary Fig. 14). Similarly, in vitro needles of an N-terminal His- 
tag fusion with wild-type PrgI bind to an anti-His-tag monoclonal 
antibody that decorates the exterior surface of such needles (Sup- 
plementary Figs 10 and 12b). These results confirm that the Prgl 
N terminus is surface-exposed, an arrangement similar to that of the 
flagellar filament", which shares a genetic origin with the T3SS needle. 

The rigid extended conformation of the N-terminal residues 
reported here clearly differs from previous models. The two aromatic 
residues Trp5 and Tyr8 form the basis of the axial N-terminal- 
domain-loop interface (Fig. 3c). Indeed, in vivo invasion assays show 
that the single point mutant Y8A leads to decreased invasiveness of 
S. typhimurium (Supplementary Fig. 14), and the double mutation 
WS5A/Y8A completely abrogates host cell invasion (Supplementary 
Fig. 14). Trp 5 also seems to have a crucial function in the lateral i to 
i+6 interface (Fig. 3d; and W5A/V20A double mutant in 
Supplementary Fig. 14). Furthermore, the atomic model reveals that 
the lateral i to i+ 5,6 interface strongly involves «2-2 helix—helix 
contacts (Fig. 3e) and that the last C-terminal residues Ile 75, Ile 76 
and Phe 79 are essential. This is confirmed by the non-invasiveness of 
the I76A/F79A PrgI double mutant (Supplementary Fig. 14). In addi- 
tion, we compared S. typhimurium wild-type and prgI-knockout cells 
either complemented with wild-type or mutated prgl by using effector 
secretion assays and transmission electron microscopy (TEM) of a 
negatively stained specimen. The results obtained (Supplementary 
Fig. 16) agree well with the invasion assays shown in Supplementary 
Fig. 14. SDS-PAGE and western blot analysis showed that the double 
mutations (W5A/Y8A and I76A/F79A), but not the single mutations 
(W5A and Y8A), abolish effector secretion (Supplementary Fig. 16a, b). 
However, the W5A/V20A double mutant can secrete effector proteins, 
which is also in agreement with the results previously obtained by 
bacterial invasion assays. TEM analysis of these strains confirmed 
needles only in wild-type and prgl-knockout cells complemented with 
wild-type prgI (Supplementary Fig. 16c, d) but not in the mutant 
strains, which might be an indication of decreased needle stability. 

Residues conserved between S. typhimurium, Shigella flexneri, 
Yersinia pestis and Escherichia coli T3SS needle proteins are mostly 
facing towards the lumen (Supplementary Figs 15 and 17). Weakly or 
non-conserved residues are exposed to the needle surface and are thus 
accessible from outside the pathogen, which could reflect a bacterial 
strategy to evade a host-cell immune response. The inner surface of the 
channel consists mainly of polar residues (see Supplementary Fig. 18 
for the electrostatic surface potential of the T3SS channel). In a similar 
manner to the situation in the flagellar filament, the most C-terminal 
residue Arg 80 (in flagellin this is Arg 494 (ref. 14)) places a positive 
charge into the channel. 

Our approach opens an avenue for the investigation of the subunit 
interfaces of other homo-oligomeric assemblies at atomic resolution in 
their native-like state** °°. For larger or more complex systems, such as 
the flagellar filament", our combination of atomic-level ssNMR struc- 
tural restraints and cryo-electron microscopy data could bridge the 
gap between X-ray or NMR structures of monodisperse subunits and 
cryo-electron microscopy data of the filamentous state. 
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METHODS SUMMARY 

Sample preparation. '°N-labelled and '°C-labelled Prgl protomers were 
expressed in E. coli. Polymerization of purified PrgI was allowed to take place at 
room temperature (22 °C) for 1 week. 

Solid-state NMR. Experiments were conducted on 14.1-T and 20.0-T wide-bore 
Bruker spectrometers at magic angle spinning (MAS) frequencies of ~11 kHz. 
C-C proton-driven spin-diffusion (PDSD) spectra were recorded with mixing 
times as indicated in the main text. 

Structure calculation. T3SS needle structures were calculated with a modified 
Rosetta fold-and-dock protocol, using a 29-subunit system. The calculations con- 
verge within 2.1 A backbone r.m.s.d. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Expression, purification and polymerization of '*N-labelled and ‘C-labelled 
wild-type PrgI protein. The synthetic full-length PrgI encoding DNA was pur- 
chased from GeneArt and cloned into a modified pET16b vector (Novagen). The 
resulting construct codes for a fusion protein with an N-terminal His, tag followed 
by a tobacco etch virus (TEV) protease cleavage recognition sequence that con- 
nects it to the PrgI protein sequence. Thus, after protease cleavage the released Prgl 
protein contains the non-native N-terminal residues glycine and histidine. The 
construct was transformed into E. coli strain BL21(DE3). Expression of labelled 
Pregl protein was performed in minimal medium with '"NH,Clas nitrogen source 
and p-[U-*Cg] glucose, p-[1-'°C] glucose or p-[2-'*C] glucose as carbon source. 
An overnight minimal medium culture was inoculated 1:25 (v/v) into 11 of 
minimal medium. The culture was shaken at 37°C until a Deoonm of 0.7. 
Protein expression was induced by adding 0.7 mM isopropyl B-b-thiogalactoside; 
5h after induction the cells were harvested and stored at —80°C. 

The cell pellet of a 1-litre expression culture was dissolved in 50 ml of lysis 
buffer (100 mM sodium phosphate, 10 mM Tris-HCl, pH 8.0, 8 M urea, 0.5 mM 
phenylmethylsulphony! fluoride (PMSF)) and stirred for 1 h at room temperature. 
Subsequently, the suspension was centrifuged for 30 min at 33,000g and the super- 
natant was incubated on a rocking shaker for 1 h with 3 ml of Ni’ -nitrilotriacetate 
(Qiagen) resin that had been pre-equilibrated with the lysis buffer. The resin 
suspension was filled into a 10-ml disposable plastic column (Pierce). The column 
was washed with 60 ml of lysis buffer and the protein was eluted with 8 X 2 ml of 
elution buffer 1 (100 mM sodium phosphate, 10 mM Tris-HCl, pH 5.9, 8 M urea, 
0.5mM PMSF) and 8 X2 ml of elution buffer2 (100 mM sodium phosphate, 
10mM Tris-HCl, pH 4.5, 8M urea, 0.5mM PMSF). The elution fractions were 
analysed on a 17.5% SDS-PAGE gel. Fractions containing the fusion protein were 
combined and concentrated with a Vivascience 5-kDa molecular mass cut-off 
concentrator (Sartorius Stedim Biotech) to a final volume of 4 ml. Subsequently, 
2-ml aliquots of the sample were loaded onto a 8mm X 250 mm semipreparative 
HPLC column (RP18 Eurospher 100; Knauer) that had been pre-equilibrated with 
0.1% aqueous trifluoroacetic acid (TFA). The column was washed with ten column 
volumes of 0.1% TFA, and the protein was eluted at a flow rate of 3 ml min ! with 
a 150-ml linear gradient of 0-100% acetonitrile/0.1% TFA. The elution peaks were 
analysed by electrospray ionization mass spectrometry (ESI-MS). Elution frac- 
tions containing the fusion protein without any further detectable impurities were 
combined and freeze-dried. 

The freeze-dried protein was dissolved in ice-cold water by vortex-mixing and 
then incubated for 1h on ice. Subsequently, using cold stock solutions, the fol- 
lowing buffer was adjusted in this aqueous protein solution: 50mM Tris-HCl, 
pH8.0, 0.5mM EDTA, 1mM dithiothreitol, 0.5mM PMSF. To this buffered 
protein solution TEV protease was added at a final ratio of 3 mg of TEV protease 
per 100 mg of fusion protein. The digestion was performed for 20h at 4°C. The 
released PrgI protein was further purified by reverse-phase HPLC as described 
above for the fusion protein. Elution fractions containing pure Prgl protein 
according to ESI-MS analysis were combined and freeze-dried. 

The freeze-dried protein was dissolved at 0.085mM_ concentration 
(é2g9 = 10,810 M_'cm!) in ice-cold water by vortex-mixing and dialysed at 
4°C against 20mM MES, pH5.5, 0.02% sodium azide. Finally, the sample was 
concentrated with a Vivascience 5-kDa molecular mass cut-off concentrator to a 
final concentration of 0.1 mM. 

The polymerization of the PrgI protein took place by incubation at room tem- 
perature for 1 week without shaking the sample. Thereafter the needles were 
centrifuged for 30 min at 52,000g. The protein pellet was washed extensively with 
20mM MES pH5.5 before transfer to the ssNMR rotor. 

Invasion assays. HeLa cell invasion assays were performed as described in ref. 18. 
In brief, cells were infected with S. typhimurium SL1344 and PrgI mutants at 
multiplicity of infection of 10:1 after inducing their expression with 0.2 1g ml’ 
anhydrotetracycline for 1h. After 20 min of infection, the cells were washed with 
PBS, and fresh medium supplemented with 100 1g ml * gentamicin was added. 
Cells were incubated for a further 2 h to allow intracellular survival and replication 
of bacteria. They were then lysed with 0.1% Triton X-100 and the lysates were 
plated on tryptic soy agar plates to count the number of viable bacteria after 16h. 
Secretion assays. Single colonies of S. typhimurium SL1344 and PrgI mutants were 
inoculated in Luria-Bertani medium with 0.04 1g ml’ anhydrotetracycline and 
incubated with shaking at 37 °C for 16h. The cultures were centrifuged at 4,354¢ 
for 10 min to remove the bacteria. The supernatants were filtered through 0.22-j1m 
filters and precipitated with trichloroacetic acid. Pellets were washed once with 
acetone, dried, resuspended in SDS loading buffer and analysed by SDS-PAGE. 
Preparation of cells for electron microscopy analysis. Cultures of S. typhimurium 
were grown as described above, and 1 ml aliquots were fixed with 2% para- 
formaldehyde for 1 h. The fixed cells were washed once with PBS and resuspended 
in 200 ul of PBS for electron microscopy analysis. 


ssNMR spectroscopy and detection of distance restraints. ssNMR experiments 
were conducted on 14.1-T and 20.0-T ('H resonance frequencies of 600 and 
850 MHz, respectively) wide-bore spectrometers (Bruker Biospin) equipped with 
4mm triple-resonance (H, '*C and '°N) MAS probes. Samples were packed in 
4-mm MAS rotors, using protein quantities of ~10 mg. Spectra were recorded at 
MAS spinning rates of 10.75 or 11 kHz and were calibrated with 4,4-dimethyl-4- 
silapentane-1-sulphonic acid (DSS) as an internal reference. The temperature- 
dependent position of the water proton resonance was used to measure the 
temperature inside the MAS rotor*'. All experiments were conducted at a sample 
temperature of 278 K. High-power 'H-'°C decoupling (SPINAL-64)* with a 
radiofrequency amplitude of 83 kHz was applied during evolution and detection 
periods. An initial ramped cross-polarization was used to transfer magnetization 
from 'H to °C with a contact time of 1.5 ms. '*C-'°C polarization transfer was 
achieved by means of PDSD* with mixing times as indicated in the text and in the 
supplementary figures. 

A N-1C PAIN-CP spectrum was recorded as described in ref. 19. Total 
experimental times were 4-9 days depending on the number of scans (maximum 
acquisition times t, of 12ms for [1-'°C]Glc-labelled and [2-'°C]Glc-labelled 
samples, and 7 ms for the mixed-labelled sample were used). Distance restraints 
with a unique assignment (spectrally unambiguous) were collected in a first round 
of analysis (corresponding to ~70% of the total number of restraints) and used to 
clarify the assignment of additional ambiguous restraints (with four or fewer 
assignment possibilities). Intermolecular distance restraints were first assigned 
in the [(1/2)-'°C]Glc-labelled sample by direct comparison with [1-!?C]Glc- 
labelled and [2-'*C]Glc-labelled samples, as described in ref. 24. On the basis of 
this first set of restraints, additional intermolecular distances were later identified 
in spectra of the [1-°C]Glc-labelled and [2-'°C]Glc-labelled samples. 

NMR spectra were analysed with CcpNmr**. 

Modelling. A modification of the fold-and-dock protocol’’, now part of the 
standard Rosetta 3.3 distribution, was used for all model calculations. All symmetric 
subunits within the needle helix are treated explicitly in a 29-subunit system. The 
protocol consists of backbone fragment insertion trials that are replicated among 
symmetric subunits and symmetrical rigid body trials of adjacent subunits using a 
coarse-grain representation of the system, followed by side-chain packing and all- 
atom refinement of the side chain, backbone and rigid body degrees of freedom, 
using a physically relevant energy function”. 

The chemical shift assignments of the backbone ‘°N and °C atoms from 
ssNMR experiments were used to bias the selection of overlapping three-residue 
and nine-residue fragments of backbone conformations. Using these fragments 
and the intrasubunit and intersubunit restraints, we performed a series of model 
calculations, starting from a helix of extended chains. A pseudo-energy term was 
used to restrain the calculations according to the agreement with the experimental 
distances, while optimizing the Rosetta energy function. ssNMR cross-peaks were 
converted to internuclear C-C and C-N distance restraints with an upper limit of 
5Aanda penalty that grew exponentially with distance. 

Intersubunit distance restraints were first classified into two categories: the 
‘axial’ restraints corresponding to i to i + 11 (equivalently i to i + n for an n-start 
arrangement in the more general case) intersubunit interactions and the ‘lateral’ 
restraints corresponding to i to i + 5/6 intersubunit interactions. The resolution of 
(5/6) ambiguities was made through the manual identification of self-consistent 
clusters of contacts between subunits i and i + 5 or between subunits i and i+ 6. 

In detail, this was done in three stages. In stage I, using the axial and intrasubunit 
restraints alone, we identify two clusters of helical arrangements (Supplementary 
Fig. 8b). The first cluster corresponds to a left-handed helix of 5.6 subunits per turn 
that showed lateral interactions between subunits i to i+ 5 andi toi+ 6, and the 
second cluster is a right-handed helix with 11 subunits per turn that shows lateral 
interactions between subunits i to i— 1 and i to i+ 1. This model is in disagree- 
ment with the estimates of subunits per turn of ~5.6 from STEM and ssNMR data 
(see the text). We therefore focus on the first cluster, using which we can easily 
assign two sets of lateral ssNMR restraints that report on theitoi +5 anditoi+6 
interfaces between the inner (C-terminal) helices (Supplementary Fig. 8a). Using 
the axial, intrasubunit and the stageI assigned lateral restraints we perform a 
second set of calculations, one for each helical handedness as the same lateral 
interactions can be satisfied by a right-handed helix and by a left-handed helix 
(stage IIR and stage IIL). This is done by keeping the axial and intersubunit inter- 
actions fixed but interchanging the restraints assigned for the i to i+ 5 and i to 
i+ 6 interfaces (Supplementary Fig. 8d). Using this second round of models we 
can further assign two clusters of restraints that correspond to lateral interactions 
connecting the outer helices (Supplementary Fig. 8c). This corresponds to two 
clusters of interactions involving W5 from subunit i + 11 and A36 from subunit i 
with the V20 kink region of subunit i+ 6 in the left-handed model, shown with 
red and yellow dashes, respectively, in Supplementary Fig. 8c. A final stage of 
calculations is then performed using all assigned lateral restraints for each helix 
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(right-handed and left-handed) and the resulting ensembles are reported as the 
final models. The calculations converge within 2.1 A backbone r.m.s.d. for the 
right-handed and 2.3 A r.m.s.d. for the left-handed model (Supplementary Fig. 
8e, f). Both models satisfy the experimental restraints with a minimal number of 
violations, although the right-handed model shows more favourable restraint and 
interface energies. Furthermore the two ensembles show very similar helical para- 
meters (number of subunits per turn, radius and axial rise) and also full-atom 
interface energies (Supplementary Table 2). We note that the central part of the 
needle protein (Leu 34—Leu 43) is highly conserved between our atomic model and 
the crystal structure of Prgl* (ref. 18) (Supplementary Fig. 8g). 

The radius of the needle was restrained during each calculation by using a 
pseudo-energy function that measures the correlation of the whole model to the 
previously published cryo-electron microscopy map”. In all calculations, no 
assumptions were made about the helical parameters of the needle (that is, the 
degree of angle rotation and unit translation along the helical axis). Instead, these 
parameters emerge in the final structural ensemble according to the interpretation 
of the intermolecular interactions. Calculations are repeated for different inter- 
pretations of the intersubunit distance restraints according to 9-start, 13-start and 
15-start helical arrangements in addition to the 11-start presented here. These 
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complementary calculations result in converged structural ensembles with similar 
axial and lateral interfaces but involving interactions between different symmetry- 
related monomers and helical parameters (Supplementary Table 3). Although no 
unambiguous distinction can be made between these geometries in terms of 
agreement with the ssNMR distance restraints alone, the symmetry considerations 
presented in the main text suggest that the 11-start arrangement agrees better with 
the complementary STEM measurements. This was the basis for selecting the 11- 
start needle as the final model. 
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CORRECTIONS & AMENDMENTS 


RETRACTION 
doi:10.1038/nature11164 


Retraction: DNA demethylation in 
hormone-induced transcriptional 


derepression 


Mi-Sun Kim, Takeshi Kondo, Ichiro Takada, Min- Young Youn, 
Yoko Yamamoto, Sayuri Takahashi, Takahiro Matsumoto, 
Sally Fujiyama, Yuko Shirode, [Ikuko Yamaoka, 

Hirochika Kitagawa, Ken-Ichi Takeyama, Hiroshi Shibuya, 
Fumiaki Ohtake & Shigeaki Kato 


Nature 461, 1007-1012 (2009); doi:10.1038/nature08456. 


In this Letter, we claimed that hormone-regulated transcriptional 
control involves DNA demethylation mediated by MBD4. Sub- 
sequently, we corrected some figure panels that appeared to have been 
erroneously prepared (Corrigendum, http://www.nature.com/nature/ 
journal/v480/n7375/full/nature10604.html). However, we later found 
that other figure panels contained data duplications. After further 
review, we now conclude that the results presented in the original 
figures had been inappropriately manipulated, and given these more 
serious concerns we no longer have confidence in the original figures. 
We therefore wish to retract this Letter and sincerely apologise for 
any adverse consequences that may have resulted from the paper’s 
publication. 
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Uta SCIENCE FICTION 


BY ROBERT NATHAN CORRELL 


cc ey George!” 
A blast of cool, processed air 
entered the room as my new 


boss strolled in. 

I stopped typing. “The name's Ben” 

“Not today.’ He threw another requisi- 
tion on top of the towering stack 
of papers already on the desk. 
“New request just came in over the 
squealer. It’s a priority — something 
about farm animals.” 

I glanced over the sheet. “Sure. Just 
give me whatever sample you've got of 
this guy’s writing and I'll get on it as soon 
as I finish this.” 

“We dont have anything. Just a list of titles 
and a name. This guy’s stuff must have got 
lost at the beginning of the Big Crash. There 
are no samples. Go wild. And do it now. 
That; he indicated my current project, “can 
wait. Two days.” 

“Two? I usually have at least four to pound 
one of these out.” 

“Not this time. I told you — it’s urgent.” 

“Looks like he did a book on livestock and 
what else, some kind of single-year history? 
What's the rush?” 

The boss sighed. “Some idiot let an old 
beta version of the Omnipedia out on the 
web and there are links to this guy all over 
it. Download requests are hitting us from 
e-readers all over the country. You're it, 
George.” 


“Hey, Ben — the guys and I are going out for 
a drink to catch the daily newsview. Want 
to come?” 

I stopped typing and sighed. “Not today, 
Muriel. Priority request. Boss says he needs 
it yesterday.” 

“Seriously? Look at you — sweating it out 
in this little closet on that ancient P. O. S. all 
damn day... why do you put up with it?” 

“Boss says it’s verisimilitude. You guys all 
get to use those telepathic scribes, but that 
doesn't sound the same as what you get out 
of a typewriter. Something about cadence.” 
I shook my head. “Besides, those things 
record everything that goes through your 
head. Doesn't that bother you?” 

She laughed. “Thinking seditious 
thoughts? Or just try- 


ing to make out your NATURE.COM 
grocery list for the  FollowFutureson 
week? Whatever — this Facebook at: 

isa dead-end job. You _ go.nature.com/mtoodm 
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SQUEALER 


Mouthpiece for a generation. 


should ask for a promotion to tech writing 
or something. You could get an office next to 
mine upstairs. At least we don’t have to dig 
through these piles of paper.” She prodded 
the swaying stacks of copy with disgust. 

I shook my head. “You all go on. I'll catch 
up with you later. Maybe Wednesday.’ 

“You really love this crap, don't you? The 
deadlines, the typewriter, the stories... If 
you weren't just churning out those prop 
pieces about duty to state and the virtues of 
hard work, Id think you were the only real 
writer left in the country.” 

“Don't say that, Muriel.” I looked out 
the door with an exaggerated motion and 
smiled. “Someone could hear” 


That night I walked back to the apartment, 
past shuttered stores and boarded-up old 
tenements. It was still long before curfew 
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and some people were about. A few were 
entering a dimly lit dive to catch the three 
hours of newsview that were mandated each 
night, while others tapped and scratched at 
their electronic tablets. I wondered if they 
were reading anything of mine. Maybe some 
Burroughs or Asimov — those had been fun 
to do. rubbed my head. This afternoon had 
not gone so well and I wished I had gone 
with Muriel and the others after all. The 
words just wouldn't come. Not writer’s 
block, but something about my assignment’s 
title that tickled the back of my mind and 
wouldnt let go. 
After an hour or so of digging through 
the hidden compartment under my 
bed, I found the answer. It was in 
the stack of old volumes I picked 
out of my grandmother's storage 
unit the year before. A decaying 
paperback with a pig and a horse on 
the cover, barely visible beneath mould 
and water stains. I held it, running a fin- 
ger across the pages and watching them flake 
away. What a trick — a bit of sleight 
of hand, electronic legerdemain. The 
boss once told me: “Just too much was 
lost in the Crash. So much of our history. 
And the idea that it’s just gone would be too 
much for most people to believe.” I looked 
at the book in my hand and laughed. I was 
replacing entire texts with real bits of fiction, 
and there was no one to know the difference 
but a dying generation and their failing 
memories. And me. I looked around my nice 
place. A real bed. Kitchenette. My own bath- 
room. Even Muriel didn't have one of those. 
On the newsview, a government drone 
dropped bombs on some Nomad shacks, 
while the occupants were marched away at 
rifle-point. Re-settlement, re-education. But 
I never saw a man who could write a good 
propaganda piece in a work camp. And I 
meant to be writing for a long, long time. 
Turning, I pushed the book down the incin- 
erator chute and began heading back to the 
office, off to write a young adult novel about 
the exciting adventures of a patriotic farmer 
and his loyal farm creatures, all toiling for 
the greater good. This, for instant delivery 
to the tablet computers of would-be revolu- 
tionaries and faux counterculture icons all 
over the nation, each of them looking for the 
ideas that could change their world. = 


Robert Nathan Correll is a postdoctoral 
fellow in cardiovascular biology who lives in 
Kentucky. He does not own an e-reader. 
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PHARMACEUTICAL SECTOR 


Delicate transition 


With lay-offs rife in the drug industry, life scientists and 
chemists are seeking fresh career paths. 


BY CHARLOTTE SCHUBERT 


ocals were irate when the drug giant 
Le closed its 70-hectare research 
and development (R&D) facility in Ann 
Arbor, Michigan, in 2007. T-shirts sporting the 


word ‘Pfired appeared on the streets; the gover- 
nor called the lay-offs a “punch to the gut”; and 


the state of Michigan pledged US$1 million to 
help the 2,100 displaced workers find new jobs. 

The pharmaceutical industry has faced major 
upheaval in recent years, with a disappointing 
drug pipeline, major revenue losses as patents 
expire on blockbuster drugs, and a spate of 
mergers and acquisitions. From 2006 through 
to the first quarter of 2012, some 263,000 
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positions have been eliminated from major 
pharmaceutical and large biotechnology com- 
panies, says Kenneth Getz, a senior research 
fellow at the Tufts Center for the Study of Drug 
Development in Boston, Massachusetts. R&D 
operations have accounted for 7-10% of the 
lay-offs since 2008, which have been only par- 
tially offset by new hiring, endangering what 
was once a stable source of jobs for life scientists 
and chemists. 

But despite such convulsions, there are posi- 
tive signs in the job market. In the Ann Arbor 
region, dozens of contract-research organiza- 
tions (CROs), many founded by former Pfizer 
employees, offer outsourced services ranging 
from medicinal chemistry to toxicology test- 
ing. The abandoned Pfizer facility has been 
reborn: the University of Michigan bought it 
and now uses some of it as research facilities 
and rents out another part to Lycera, a biotech- 
nology spin-off from the university that part- 
ners with pharmaceutical company Merck and 
employs some former Pfizer scientists. 

“What we are seeing in front of our eyes is the 
slow-motion implosion of the big pharma com- 
panies as we know them, and the rebirth of the 
industry with different models and in differ- 
ent forms,” says Bernard Munos, founder of the 
InnoThink Center for Research in Biomedical 
Innovation in Indianapolis, Indiana. 

Researchers looking for work in this 
environment need to adapt their skills to an 
industry in flux, says Munos, and consider 
how to use their experience to secure a new 
type of job. They should also be aware that 
laid-off researchers may have to take jobs 
at lower salaries at CROs or biotechnology 
start-ups, or in other industries. In short, to 
weather the cuts — which show no signs of 
abating — pharma employees and new gradu- 
ates “are going to have to hustle’, says Munos. 


R&D BREAKDOWN 
In the face of declining pharmaceutical rev- 
enues, a variety of strategies has emerged to 
increase productivity and decrease costs. Com- 
panies such as GlaxoSmithKline, headquartered 
in London, have broken up research depart- 
ments into smaller, more nimble units, and 
many firms are outsourcing R&D that would 
once have been done in-house. Meanwhile, 
research areas have been cut back. For exam- 
ple, Novartis, based in Basel, Switzerland, has 
reduced development of drugs that affect the 
central nervous system, considered a high-risk, 
expensive field (see Nature 480, 161-162; 2011). 
Firms, including Novartis, are shifting > 
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> operations to areas such as Boston, where 
they can mine academia and biotech compa- 
nies for early-stage discoveries, and China, 
an emerging market with a growing scientific 
workforce. Outsourcing may account for much 
of the net workforce reductions over the past 
several years, which occurred even as total 
investment in R&D by major pharmaceutical 
and biotechnology companies increased. An 
estimated 41,275 workers were employed in 
pharmaceutical- and biotechnology-industry 
R&D worldwide in 2010, down from 50,750 in 
2008, according to the Tufts Center (see ‘Out- 
sourcing on the rise’). 

The closures affect all workers, from labora- 
tory heads to technicians. But some jobs seem 
to be more vulnerable than others. When 
Pfizer, which is based in New York City, laid 
off employees in Ann Arbor, it offered jobs 
to hundreds of them at other locations. Most 
were scientists with transferable skills, such 
as computational biologists, or worked in hot 
areas such as oncology, says John LaMattina, 
who oversaw the lay-offs as head of global 
R&D at Pfizer and is now a senior partner at 
Puretech Ventures, a life-sciences venture-cap- 
ital company in Boston. Specialists in waning 
fields are often most vulnerable, he says. 

Bench scientists who work in the earliest 
stages of drug research may also be at high risk, 
as many pharmaceutical companies turn to aca- 
demia and biotechnology companies for leads. 
“Major R&D organizations within big pharma 
have just been slashing without a lot of regard 


OUTSOURCING ON THE RISE 


As pharma downsizes, contract-research 
organizations (CROs) have added workers. 
@ Major pharma and biotech companies 
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in drug discovery,’ says John Archer, founder 
of Catalyst Advisors in New York City, which 
recruits executives for pharmaceutical and 
biotechnology companies. People who work in 
clinical research and regulatory affairs seem to 
be better buffered from lay-offs, he says. 


STAYING AHEAD 

It is difficult to trace where the jobs are going, 
but a lot of people do manage to find work. 
In the United Kingdom, for instance, about 
2,000 chemists at pharmaceutical companies 
were laid off last year, estimates Charlotte 
Ashley-Roberts, a careers adviser at the Royal 
Society of Chemistry, based in Cambridge, UK. 


INTO CONTRACT RESEARCH 


Prepare and contrast 


Contract-research organizations (CROs) 
are increasingly taking on tasks previously 
done within the pharmaceutical industry, 
from clinical-trial management to 
medicinal chemistry. Many former 
employees of big pharmaceutical 
companies are moving to CROs — and 
liking it. “I thought the environment was 
going to be significantly different,” says 
Jim Kremidas, who made the leap in 2008. 
But the change “was not as traumatic as | 
thought it was going to be”. 

Kremidas had spent more than 20 years 
in various jobs at Eli Lilly in Indianapolis, 
Indiana, culminating in a role as head of 
patient recruitment. But as the company 
downsized in preparation for patent 
expirations, he accepted a severance 
package. Knowing what lay ahead, Kremidas 
had time to prepare. 

Before his retirement, Kremidas began 
developing his contact network. He also took 
on speaking engagements through the Drug 
Information Association, an international 
industry trade group based in Horsham, 
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Pennsylvania. The exposure helped to 

build his reputation as a leader in patient 
recruitment, he says, and in 2008 he was 
offered a job doing just that at Quintiles, a 
CRO based in Durham, North Carolina. This 
February, Kremidas became head of market 
development for digital patient recruitment 
at the firm, using the Internet and online 
patient databases to gather volunteers. 

Kremidas advises pharmaceutical 
employees who are worried about lay-offs 
to broaden their experiences to prepare 
themselves for a variety of future challenges. 
“| knew | had a skill set that was needed in 
industry, and Quintiles seemed like a logical 
place for me to land,” he says. 

Pharma and CRO work have similarities, 
but Kremidas says that CROs require 
employees to be more flexible and nimble: 
“You have a lot of different customers who 
have a lot of different ways of doing things.” 
Some customers micromanage, whereas 
others let the CRO researchers make their 
own decisions. “They respect your opinions 
and it’s more collaborative,” he says. €.8. 
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More than 85% found jobs within three months, 
and 60% in chemistry. “You are starting to see 
a whole different cadre of opportunities,’ says 
John Arrowsmith, a life-sciences adviser at 
Thomson Reuters in London. 

Many former pharmaceutical researchers 
are heading to CROs (see ‘Prepare and con- 
trast’), which have been growing steadily in the 
United States, Europe, India and China in recent 
years (see Nature 466, 280-281; 2010). In 2010, 
46,550 people were employed in R&D at CROs 
worldwide, up from 42,687 in 2008, estimates 
the Tufts Center. 

People with skills beyond bench work are 
moving into consultancy, as experts in areas 
such as regulatory affairs, clinical-trial man- 
agement and biostatistics (see “Use your skills’). 
But there are no hard numbers on who is tak- 
ing this route — or on how many consultants 
are effectively underemployed. 

Yet other workers are retooling their skills for 
related industries that remain strong, includ- 
ing development of medical food (such as 
‘gut-healthy’ yogurt), medical-device engineer- 
ing and biomanufacturing, says Clifford Minz, 
founder of BioInsights, a career consultancy in 
Princeton, New Jersey. Patent specialists and 
medical writers are also in demand, he adds. 


THE GREAT LEAP SIDEWAYS 

Alex Flood is a former pharma researcher who 
has successfully made the transition to one 
growing niche sector: non-profit work. He “cut 
his teeth” at Wyeth and weathered that com- 
pany’s 2009 buy-out by Pfizer, but for many 
years he had aspired to a job in public health. 
Since 2010, he has been employed at PATH, a 
global-health non-profit organization in Seattle, 
Washington, where he works on vaccine stabili- 
zation — by, for example, devising ways to keep 
vaccines fresh over time. To find new work, “you 
have to be flexible’, says Flood, who adapted his 
pharma training to his new job. 

Peter Corr, co-founder of Celtic Therapeutics, 
a private-equity drug-development firm in New 
York City, hires senior and junior pharmaceu- 
tical professionals with a wide range of expe- 
rience, from outsourcing to finance. He was 
head of science and technology at Pfizer until 
2006, and says that it helps if candidates have an 
understanding of the whole drug-development 
pipeline. “Spend some time in your off hours in 
other parts of the company, he advises potential 
applicants. For example, bench scientists should 
expand their skill sets by learning about regu- 
latory affairs or business development. Being 
open to relocation also helps, says Archer, given 
the geographic shifts in the industry. 

The best candidates show passion for what 
they do and have taken on challenges, says 
Corr. “You see people who are moving to gain 
new experiences,’ he says. “These people are 
constantly stretching themselves? m 


Charlotte Schubert is a freelance journalist 
based in Seattle, Washington. 
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CONSULTANCY 


Use your skills 


In many ways, Beat Widler was 
ideally placed to start a consultancy. 
He had spent decades in regulatory 
affairs and clinical research at Roche, 
the pharmaceutical firm based in 
Basel, Switzerland. Most recently, he 
was global head of clinical quality, 
ensuring that clinical trials protected 
human subjects and maintained 
data integrity. Now, he is a consultant 
in the same area. 

Working out of his home in Zug, 
Switzerland, Widler takes advantage 
of a network of contacts in the 
pharmaceutical industry, contract- 
research organizations and regulatory 
agencies. Even so, setting up a 
company was risky. “If we are able 
to break even this year we can be 
extremely proud of ourselves,” he says. 

Widler had been thinking for 
years about starting a company. 
When Roche offered him an early- 
retirement package in 2011, he 
took the plunge, setting up Widler 
& Schiemann with former Roche 
colleague Peter Schiemann this year. 

Widler says that the shifts in the 
industry are making it easier for 
former pharma employees to set up 
shop, as big companies and small 
biotechs turn to an outsourcing 
model with low overhead costs. 

And cost pressures are leading 
companies to rely on experts to help 
them trim the fat from their clinical 
trials, while keeping standards high. 

Widler’s network includes 
connections at professional 
organizations such as the European 
Forum for Good Clinical Practice in 
Brussels and the Drug Information 
Association, based in Horsham, 
Pennsylvania, where he has served 
on committees and given talks. That 
experience, he says, helped him to 
build his reputation and meet clients. 

Without the infrastructure of a 
large organization, Widler has had to 
adapt. For example, he spent hours 
creating the template for a form for 
auditing a client. “You do everything 
from scratch. It’s pretty intense, but 
it’s pretty fun,” he says. 

He has no regrets and remains 
optimistic, but is mindful of how long 
his personal funds can last while 
he builds up his business. “Be very 
realistic about finances,” he says. “It 
is critical to do your homework.” 6.8. 


TURNING POINT 
Jim Hoch 


In April, Jim Hoch, a molecular biologist at 
the Scripps Research Institute in San Diego, 
California, celebrated the ninth renewal of 

the grant supporting his study of bacterial 
signalling proteins. Here, he reflects on how his 
efforts to unravel sporulation led to a three- 
decade US National Institutes of Health (NIH) 
grant — one of the longest-running at Scripps. 


How did your research get started? 

I came to Scripps after an NIH-funded 
postdoc at the Molecular Genetics Centre in 
Gif-sur-Yvette, France, where I learned about 
mapping genetics in bacteria. Once here, I 
applied the technique to begin to sort out the 
mechanisms that trigger sporulation, the pro- 
cess by which bacteria or fungi suspend their 
growth to form tough, seed-like spores. Since 
we started, my team and I have learned about 
the genes and proteins involved, but we are still 
piecing together how it works. The signalling 
mechanisms are a mess to unravel. 


Was your first grant, to study the bacterium 
Bacillus subtilis, a turning point in your career? 
I owe my career to this grant from the US 
National Institute of General Medical Science. 
I’ve been told that you should have several 
grants, but I had just one that I bundled every- 
thing into. If] had lost that grant at any renewal, 
I would have been dead. The renewal process is 
fairly traumatic, but it has motivated me to work 
hard each grant cycle. Because Scripps started 
hosting graduate students only recently, I have 
employed technicians, various undergraduates 
and some postdocs. And we managed to do 
pretty amazing things. 


How have modern technologies influenced 

your research? 

The evolution of technology has driven the 
experiments. I started with genetics, but molec- 
ular cloning and DNA sequencing changed 
everything — letting us find out about the pro- 
teins encoded by the genes. Using biochemistry, 
we could work out their functions. From there we 
used crystallography and nuclear magnetic reso- 
nance to establish the structure of the proteins. 
Most recently, we have been working with statis- 
tical physicists to determine how these proteins 
interact. Every year is a complete learning 
process — a quantum leap from one technology 
to the next. It has been a hell of a journey. 


How have NIH requirements changed? 

Proposals used to be more than 20 pages long, 
and the study sections that review grants lasted 
for three days. Now, proposals are 12 pages, 
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study sections last one day and half of the 
applications, the less-impressive ones, are not 
even discussed. There is a lesson here. Applica- 
tions of 12 pages need to be clear and concise 
to make them understandable outside the field. 
Most importantly, they need to be exciting to 
read. A proposal needs to have clearly articu- 
lated goals that transmit your excitement. 


Have you ever thought that your grant 
wouldn’t get renewed? 

Ithas become more difficult over the years; there 
is more competition. I was worried in the latest 
round. I broke my leg and was recovering for a 
year. Normally, I have five or six publications a 
year, so that every four years, when it is time for 
renewal, I have 20-25 papers showing my pro- 
gress. I didn’t have that this time, but I squeaked 
by with a few good papers in good journals. 


Your career has been mostly in basic research. 
Have there been any interesting applications? 
There have been some spin-offs. When I 
first came to Scripps, I was working on genes 
involved in the hyperproduction of proteo- 
lytic enzymes. One of my postdocs ended up 
as an executive at the biotechnology company 
Genentech, based in South San Francisco, 
California. He recognized that proteases could 
be important in the production of detergent. 
The proteases in most US soaps come from 
different species of Bacillus, and from some 
hyperproduction genes that we discovered. 
A company was spun off: Genencor, which is 
now owned by Dupont. It has produced more 
than US$1 billion’s worth of enzymes. m 
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